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NXT Heparin A/flnJty Qlyc.grad. 
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FIGURE 9 

1 AAAACCCCAA AACCCCAAAA CCCCTTTTAG AGCCCTGCAG TTGGAAATAT 
5 1 AACCTCAGTA TTAATAAGCT CAGATTTTAA ATATTAATTA CAAAACCTAA 
101 ATGGAGGTTG ATGTTGATAA TCAAGCTGAT AATCATGGCA TTCACTCAGC 
1 5 1 TCTTAAGACT TGTGAAGAAA TTAAAGAAGC TAAAACGTTG TACTCTTGGA 
20 1 TCCAGAAAGT TATTAGATGA AG AAATC AAT CTCAAAGTCA TTATAAAGAT 
251 TTAGAAGATA TTAAAATATT TGCGCAGACA AATATTGTTG CTACTCCACG 
301 AGACTATAAT GAAGAAGATT TTAAAGTTAT TGCAAGAAAA GAAGTATTTT 
35 1 CAACTGGACT AATGATCGAA CTTATTGACA AATGCTTAGT TGAACTTCTT 
401 TCATCAAGCG ATGTTTCAGA TAGACAAAAA CTTCAATGAT TTGGATTTCA 
45 1 ACTTAAGGGA AATCAATTAG CAAAGACCCA TTTATTAACA GCTCTTTCAA 
50 1 CTCAAAAGCA GTATTTCTTT CAAG ACGAAT GGAACCAAGT TAGAGCAATG 
55 1 ATTGGAAATG AGCTCTTCCG ACATCTCTAC ACTAAATATT TAATATTCCA 
60 1 GCGAACTTCT GAAGGAACTC TTGTTC AATT TTGCGGGAAT AACG T TTTTG 
65 1 ATCATTTGAA AGTCAACGAT AAGTTTGACA AAAAGCAAAA AGGTGGAGCA 
701 GCAGACATGA ATGAACCTCG ATGTTGATCA ACCTGCAAAT ACAATGTCAA 
75 1 GAATGAGAAA GATCACTTTC TCAACAACAT CAACGTGCCG AATTGGAATA 
801 ATATGAAATC AAGAACCAGA ATATTTTATT GCACTCATTT TAATAGAAAT 
85 1 AACCAATTCT TCAAAAAGCA TGAGTTTGTG AGTAACAAAA ACAATATTTC 
901 AGCGATGGAC AGAGCTCAGA CGATATTCAC GAATATATTC AGATTTAATA 
95 1 GAATTAGAAA GAAGCTAAAA GATAAGGTTA TCGAAAAAAT TGCCTACATG 
1 00 1 CTTGAGAAAG TC AAAGATTT TAACTTCAAC TACTATTTAA C AAAATCTTG 
105 1 TCCTCTTCCA GAAAATTGGC GGGAACGGAA ACAAAAAATC GAAAACTTGA 
1 101 TAAATAAAAC TAGAGAAGAA AAGTCGAAGT ACTATGAAGA GCTGTTTAGC 
1151 TACACAACTG ATAATAAATG CGTCACACAA TTTATTAATG A ATT T T TCTA 
1 20 1 CAATATACTC CCC AAAGACT TTTTGACTGG AAGAAACCGT AAGAATTTTC 
1 25 1 AAAAGAAAGT TAAGAAATAT GTGGAACTAA ACAAGCATGA ACTCATTCAC 
1 301 AAAAACTTAT TGCTTGAGAA GATCAATACA AGAGAAATAT CATGGATGCA 
1351 GGTTGAGACC TCTGCAAAGC ATTTTTATTA TTTTGATCAC GAAAACATCT 
1 40 1 ACGTCTTATG GAAATTGCTC CG ATGG ATAT TCGAGGATCT CGTCGTCTCG 
1 45 1 CTGATTAGAT GATTTTTCTA TGTCACCGAG CAACAGAAAA GTTACTCCAA 
1 50 1 AACCTATTAC TACAGAAAGA ATATTTGGGA CGTCATTATG AAAATGTCAA 
1 55 1 TCGCAGACTT AAAGAAGGAA ACGCTTGCTG AGGTCCAAGA AAAAGAGGTT 
1 60 1 G AAGAATGGA AAAAGTCGCT TGG ATTTGC A CCTGGAAAAC TC AG ACTAAT 
1 65 1 ACCGAAGAAA ACTACTTTCC GTCCAATTAT GACTTTCAAT AAGAAGATTG 
1 70 1 TAAATTCAGA CCGGAAGACT ACAAAATTAA CTACAAATAC GAAGTTATTG 
1 75 1 AACTCTCACT TAATGCTTAA GACATTGAAG AATAGAATGT TTAAAGATCC 
1 801 TTTTGGATTC GCTGTTTTTA ACTATGATGA TGTAATGAAA AAGTATGAGG 
1851 AGTTTGTTTG CAAATGGAAG CAAGTTGGAC AACCAAAACT CTTCTTTGCA 
1 901 ACTATGGATA TCGAAAAGTG ATATGATAGT GTAAACAGAG AAAAACTATC 
1 95 1 AACATTCCTA AAAACTACTA AATTACTTTC TTCAGATTTC TGGATTATGA 
200 1 CTGCACAAAT TCTAAAGAGA AAGAATAACA TAGTTATCGA TTCGAAAAAC 
2051 TTTAGAAAGA AAGAAATGAA AGATTATTTT AGACAGAAAT TCCAGAAGAT 
2101 TGCACTTGAA GGAGGACAAT ATCC A A CCTT ATTCAGTGTT CTTGAAAATG 
2151 AACAAAATGA CTTAAATGCA AAGAAAACAT TAATTGTTGA AGCAAAGCAA 
2201 AGAAATTATT TTAAGAAAGA TAACTTACTT CAACCAGTCA TTAATATTTG 
225 1 CCAATATAAT TAC ATTAACT TTAATGGGAA GTTTTATAAA CAAACAAAAG 
2301 GAATTCCTCA AGGTCTTTGA GTTTCATCAA TTTTGTCATC ATTTTATTAT 



FIGURE 9 (cont.) 



2351 GCAACATTAG AGGAAAGCTC CTTAGGATTC CTTAGAGATG AATCAATGAA 
2401 CCCTGAAAAT CCAAATGTTA ATCTTCTAAT GAGACTTACA GATGACTATC 
2451 TTTTGATTAC AACTCAAGAG AATAATGCAG TATTGTTTAT TGAGAAACTT 
2501 ATAAACGTAA GTCGTGAAAA TGGATTTAAA TTCAATATGA AGAAACTACA 
2551 GACTAGTTTT CCATTAAGTC CAAGCAAATT TGCAAAATAC GGAATGGATA 
2601 GTGTTGAGGA GCAAAATATT GTTCAAGATT ACTGCGATTG GATTGGCATC 
2651 TCAATTGATA TGAAAACTCT TGCTTTAATG CCAAATATTA ACTTGAGAAT 
2701 AGAAGGAATT CTGTGTACAC TCAATCTAAA CATGCAAACA AAGAAAGCAT 
2751 CAAT GTGGC T CAAGAAGAAA CTAAAGTCGT TTTTAATGAA TAACATTACC 
2801 CATTATTTTA GAAAGACGAT TACAACCGAA GACTTTGCGA ATAAAACTCT 
2851 CAACAAGTTA TTTATATCAG GCGGTTACAA ATACATGCAA TGAGCCAAAG 
2901 AATACAAGGA CCACTTTAAG AAGAACTTAG CTATGAGCAG TATGATCGAC 
295 1 TTAGAGGTAT CTAAAATTAT ATACTCTGTA ACCAGAGCAT TCTTTAAATA 
3001 CC TTGTG TGC AATATTAAGG ATACAATTTT TGGAGAGGAG CATTATCCAG 
305 1 ACTTTTTCCT TAGCACACTG AAGCACTTTA TTGAAATATT CAGCACAAAA 
3101 AAGTAC ATTT TCAACAG AGT TTGCATGATC CTCAAGGCAA AAGAAGCAAA 
3151 GCTAAAAAGT GA CCAA TGTC AATCTCTAAT TCAATATGAT GCATAGTCGA 
3201 CTATTCTAAC TTATTTTGGA AAGTTAATTT TCAATTTTTG TCTTATATAC 
3251 TGGGGTTTTG GGGTTTTGGG GTTTTGGGG 
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FIGURE 10 

1 MEVDVDNQAD NHGIHSALKT CEEIKEAKTL YSWIQKVIRC RNQSQSHYKD 
51 LEDIKIFAQT NIVATPRDYN EEDFKVIARK EVFSTGLMIE LIDKCLVELL 
101 SSSDVSDRQK LQCFGFQLKG NQLAKTHLLT ALSTQKQYFF QDEWNQVRAM 
151 IGNELFRHLY TKYLIFQRTS EGTLVQFCGN NVFDHLKVND KFDKJKQKGGA 
201 ADMNEPRCCS TCKYNVKNEK DHFLNNINVP NWNNMKSRTR IFYCTHFNRN 
251 NQFFKKHEFV SNKNNISAMD RAQTIFTNIF RFNRIRKKLK DKVIEKIAYM 
301 LEKVKDFNFN YYLTKSCPLP ENWRERKQKI ENLINKTREE KSKYYEELFS 
351 YTTDNKCVTQ FINEFFYNIL PKDFLTGRNR KNFQKKVKKY VELNKHELIH 
401 KNLLLEKINT REISWMQVET SAKHFYYFDH ENIYVLWKLL RWIFEDLWS 
451 LIRCFFYVTE QQKSYSKTYY YRKN1WDVIM KMSIADLKKE TLAEVQEKEV 
501 EEWKKSLGFA PGKLRLIPKK TTFRPIMTFN KKIVNSDRKT TKLTTNTKLL 
55 1 NSHLMLKTLK NRMFKDPFGF AVFNYDDVMK KYEEFVCKWK QVGQPKLFFA 
601 TMDDEKCYDS VNREKLSTFL KTTKLLSSDF WIMTAQILKR KNNIVIDSKN 
651 FRKKEMKDYF RQKFQKIALE GGQYPTLFSV LENEQNDLNA KKTLIVEAKQ 
701 RNYFKKDNLL QPVINICQYN YINFNGKFYK QTKGIPQGLC VSSILSSFYY 
751 ATLEESSLGF LRDESMNPEN PNVNLLMRLT DDYLLITTQE NNAVLFIEKL 
801 INVSRENGFK FNMKKLQTSF PLSPSKFAKY GMDSVEEQNI VQDYCDWIGI 
851 SIDMKTLALM PNINLRIEGI LCTLNLNMQT KKASMWLKKK LKSFLMNNIT 
901 HYFRKTITTE DFANKTLNKL FISGGYKYMQ CAKEYKDHFK KNLAMSSMID 
951 LEVSKIIYSV TRAFFKYLVC NIKDTIFGEE HYPDFFLSTL KHFIEIFSTK 
1001 KYIFNRVCMI LKAKEAKLKS DQCQSLIQYD A 



FIGURE 11 

1 CCCCAAAACC CCAAAACCCC AAAACCCCTA TAAAAAAAGA AAAAATTGAG 
5 1 GTAGTTTAGA AATAAAATAT TATTCCCGCA CAAATGGAGA TGGATATTGA 
1 0 1 TTTGGATG AT ATAGAAAATT TACTTCCTAA TACATTCAAC AAGTATAGCA 
1 5 1 GCTCTTGTAG TGACAAGAAA GGATGCAAAA CATTGAAATC TGGCTCGAAA 
201 TCGCCTTCAT TGACTATTCC AAAGTTGCAA AAACAATTAG AGTTCTACTT 
251 CTCGGATGCA AATCTTTATA ACGATTCTTT CTTGAGAAAA TTAGTTTTAA 
301 AAAGCGGAGA GCAAAGAGTA GAAATTGAAA CATTACTAAT GTTTAAATAA 
35 1 AATCAGGTAA TGAGGATTAT TCTA TTTTTT AGATCACTTC TTAAGGAGCA 
401 TTATGGAGAA AATTACTTAA TACTAAAAGG TAAACAGTTT GG ATTATTTC 
45 1 CCTAGCCAAC AATGATGAGT ATATTAAATT CATATGAGAA TGAGTCAAAG 
501 GATCTCGATA CATCAGACTT ACCAAAGACA AACTCGCTAT AAAACGCAAG 
55 1 AAAAA GTTTG ATAATCGAAC AGCAGAAGAA CTTATTGCAT TTACTATTCG 
601 TATGGGTTTT ATTACAATTG TTTTAGGTAT CGACGGTGAA CTCCCGAGTC 
651 TTGAGACAAT TGAAAAAGCT GTTTACAACT GAAGGAATCG CAGTTCTGAA 
701 AGTTCTGATG TGTATGCCAT TATTTTGTGA ATTAATCTCA AATATCTTAT 
75 1 CTCAATTTAA TGGATAGCTA TAGAAACAAA CCAAATAAAC CATGCAAGTT 
801 TAATGGAATA TACGTTAAAT CCTTTGGGAC AAATGCACAC TGAATTTATA 
851 TTGGATTCTT AAAGCATAGA TACACAGAAT GCTTTAGAGA CTGATTTAGC 
901 TTACAACAGA TTACCTGTTT TGATTACTCT TGCTCATCTC TTATATCTTT 
951 AAAAGAAGCA GGCGAAATGA AAAGAAGACT AAAGAAAGAG ATTTCAAAAT 
1001 TTGTTGATTC TTCTGTAACC GGAATTAACA ACAAGAATAT TAGCAACGAA 
1 05 1 AAAGAAG AAG AGCTATCACA ATCCTGATTC TTAAAGATTT CAAAAATTCC 
1 101 AGGTAAGAGA GATACATTCA TTAAAATTCA TATATTATAG TTTTTCATTT 
1151 CACAGCTGTT ATTTTCTTTT ATCTTAACAA TATTTTTTGA TTAGCTGGAA 
1201 GTAAAAAGTA TCAAATAAGA GAAGCGCTAG ACTGAGGTAA CTTAGCTTAT 
1 25 1 TCACATTCAT AGATCGACCT TCATATATCC AATACGATGA TAAGGAAACA 
1301 GCAGTC ATCC GTTTTAAAAA TAGTGCTATG AGGACTAAAT TTTTAGAGTC 
1351 AAGAAATGG A GCCG AAATCT TAATCAAAAA GAATTGCGTC GATATTGCAA 
1401 AAGAATCG A A CTCTAAATCT TTCGTTAATA AGTATTACCA ATCTTGATTG 
1451 ATTGAAGAGA TTGACGAGGC AACTGCACAG AAGATCATTA AAGAAATAAA 
1501 GTAACTTTTA TTAATTAGAG AATAAACTAA ATTACTAATA TAGAGATCAG 
1 551 CGATCTTCAA TTGACGAAAT AAAAGCTGAA CTAAAGTTAG ACAATAAAAA 
1 60 1 ATACAAACCT TGGTCAAAAT ATTGAGG AAG GAAAAGAAGA CCAGTTAGCA 
1 65 1 AAAGAAAAA A TAAG GCAATA AATAAAATGA GTACAGAAGT G AAGAAATAA 
1 70 1 AAG ATTTATT TTTTTCAATA ATTTATTGAA AAGAGGGGTT TTGGGGTTTT 
1751 GGGGTTTTGG GG 
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CCCCAAAACCCCAAAACCCCAAAACCCCTATAAAAAAAGAAAAAATTGAGGTAGTTTAGA 

1 + <- * 60 

GGGGTTTTCGGGTTTTGGGGTTTTGGGGATATTTT 

a PQNPKTPKPL * KKK K L R ' * F R 

b PKTPKPQNPYKKR KNCGSLE 

c PKPQNPKTPIKKEKIEVV'K- 

AAT AAAAT ATT ATTCCCG C AC AAATGG AG ATGG AT ATTG ATTTGGATG AT AT AG AAAATT 

61 * ♦ * * 120 

TTATTTTATAATAAGGGCCTGTTTACCTCTACCTATA^ 

a NXILFPHKWRWILIWHI.*XI 
Mb rXYYSRTNGDGYCFCC'Y.RXF- 
paj. C * W I I ? A 0 M E M D I D L D D I " Z N L - 

EJ! TACTTCCTAATACATTCAACAAGTATAGCAGCTCTTGTAGTGACAAGAAAGGATGCAAAA 

In 121 * * * 180 

I ATG AAGG ATTATGT AAG TTG TTC AT ATCGTCG AG AAC ATC ACTGTTCTTTCCTACGTTTT 

Ai a YFLIHSTS1AALVVTRKDAK 

f b TS'YIQQV-QLL* Q E R M Q N - 

jwri C I»PNTrNKXSSSCSDKKGCXT- 

P ■ CATTGAAATCTGGCTCGAAATCGCCTTC ATTG ACT ATTCC AAAGTTGCAAAAAC AATTAG 

* 181 * * 240 

m * GTAACTTTAGACCGAGCTTTAGCGGAAGTAACTGATAAGGTT^CAACGTTTTTGTTAATC 
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a HCNLARNRLHC LFQ SCKNN* 

b ZEZWLEIAFIOYS KVAKTIR- 

C LKSCSKSPS LTI P K L Q K Q L E - 



S; ACTTCTACTTCTCGCATGCAAATCTTTATAACGATTCTTTCTTG 

241 - - ♦ * 300 

j*5 j TC AAG ATG AAG AG CCTACGTTT AG AAATATTGCTAAGAAAGAACTCTTTTAATC AAAATT 

a SSTSRMQIFITILSCEN'F- 

b VLLLGCXSL * RFFLEK I SFX 

C FYFSOANLYNDS T LRKLVLK- 

AAAGCGGAGAGCAAAGAGTAGAAATTGAAACATTACTAATGTTTAAATAAAATCAGGTAA 

301 - 360 

TTTCCCCTCTCGTTTCTCATCTTTAACTTTCTAATGATTACAAATTTATTTTAGTCCATT 

a KAESXE'KLKHY ' C L N K I R • 

b XRRAXSRNCNITNV* t K S G N - 

C SGEQRVRIETLLMFK * N Q V M - 

TGAGGATTATTCTATTTTTTAGATCACTTCTTAAGGAGCATTATGGAG AAAATT ACTTAA 

361 42C 

ACTCCTAATAAGATAAAAAATCTAGTCAAGAATTCCTCGTAATACCTCTTTTAATGAATT 

a CCLFYFLDHFLRS I MERIT- 

b EDYSIF* ITS "GALWRKLLN- 

C Rl ILFFRSLLKEH YG E^fYLI~ 

TACTAAAAGGTAAACAGTTTGGATTATTTCCCTAGCCAACAATGATGAGTATATTAAATT 

421 480 

ATGATTTTCCATTTGTCAAACCTAATAAAGGGATCGGTTCTTACTACTCATATAATTTAA 

a Y-KVMSLOYFPSQQCCVY'I 

b TKR'TVWI ISLANNDEYIKF 

c LKGKQFGUFP * PTMMSILNS- 
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CATATGAGAATGAGTCAAAGGATCTCGATACATCAGACTTACCAAAGACAAACTCGCTAT 

481 -- - - * * 540 

GT AT ACTCTT ACTC AGTTTCCT AGAGCT ATGT AGTCTG AATGGTTTCTGTTTGAGCGAT A 

a HMRMSQR IS I HQTYQRQTRY 

b ICECVKCSRY IRLTKDKLAI - 

C YENESKDLDTSOLPKTNSL * - 

AAAACGCAAGAAAAAGTTTCATAATCGAACAGCAGAAGAACTT 

541 — 600 

TTTTGCGTTCTTTTTCAAACTATTAGCTTGTCGTCTTCTTGAA 

a KTQEKVC * S H SRRTYCIYYS 

b KRKKKFDNRTAEELIAFTIR- 

c N A R K S L I I EOOKNLLHLLFV- 

TATGCGTTTTATTACAATTGTTTTAGGTATCGACGGTGAACTCCCGAGTCTTGAGACAAT 

601 " ♦ * 660 

ATACCCAAAATAATGTTAACAAAATCCATAGCTGCCACTTGAGGGCTCAGAACTCT^ 

a YGFYYNC FRY R RCTPESCDN-- 

b MGFITIVLG I DGELPSLETI 

C WVLLOLF-VSTVNSRVLRQL- 

M TGAAAAAGCTGTTTACAACTGAAGGAATCGCAGTTCTCAAAGTTCTGATX5 

p 661 - * * 720 

p; ACTTTTTCGACAAATGTTGACTTCCTTACCGTCAAGACTTTCAAGACTACACATACGGTA 

U\ «" CKSCLOLKESQFCKF-CVCH 
%\ b EKAVYNCRNRSSESSDVYAI 
# s c KKLFTT EG I AVLKVLMCMPL- 
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TATTTTGTGAATTAATCTCAAATATCTTATCTCAATTTAATGGATAGCTATAGAAACAAA 



y.\ ATAAAACACTTAATTAGACTTTATAGAATAGAGTTAAATTACCTATCGA 

* * YFVN'SQISYLNLMDSYRNK 

f»J b ILCINLKYLISI'WIAIETN- 

f J C FCELISNILSQFNG-L-KQT- 

^ CCAAATAAACCATGCAAGTTTAATGGAATATACGTTAAATCCTTTGGGACAAATGCACAC 
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H| GGTTTATTTGGTACGTTCAAATTACCTTATATCCAATTTAGGAAACCCTGTTTACGTGTG 

M a PNKPCKrNCIYVKSFGTNAH 

FH b OINHASLMEYTLNPLGQMHT 

' " c K * T - 0 V • W N I R • I L W D K C T L 
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TGAATTTATATTGCATTCTTAAAGCATAGATACACAGAATGCTTTAGAGACTGATTTAGC 
ACTTA*AATA - AACCTAAG -ATTTCCTATCTATGTCTCTTACGAAATCTCTGACTAAATCG 



a CIYtCFLXHRYTECFRDCFS 

° ^--ti-^S-SIOTONALETD'LA- 

c N L Y ! L X A • I H R M L • R L I • L - 



TTACAACACATTACCTGTTTTCATTACTCTTGCTCATCTCTTATATCTTTAAAAGAAGCA 
AATGTTCTCT^TGGACAAAACTAATGAGAACGAGTAGAGAATATAGAAATTTTCTTCGT 



a LQQiTCFDYSCSSLISLKEA 

b YNRLPVLITLAHLLYL'KKQ- 

c T T 0 Y L F C LLL L I SY I FKRSR- 



GGCGAAATGAAAACAAGACTAAAGAAAGAGATTTCAAAATTTCTTGATTCTT 
CCGCTTTACTTTTCTTCTGATTTCTTTCTCTAAAGTTTTAAACAACTAAGAAGACATTGG 
C Z M X R ft L X K E I S K F V D S S V T 
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C *NEKK?XERDFKICCFFCNR 
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TTAAAGATTTCAAAAATTCCAGGTAAGAGAGATACATTCATTAAAATTCATATATTATAG 

1081 1140 

AATTTCTAAAGTTTTTAAGGTCCATTCTCTCT 

a LKISKIPGKRDTFI K IH I L " 

b "RFQKFQVREIHSLKFIYYS- 

C KDFKNSR • ERY I H • N S Y I I V - 



TTTTTCATTTCACAGCTGTTATTTTCTTT^ 

AAAAAGTAAAGTGTCGACAATAAAAGAAAATAGAATTGTTATAAAAAACTAATCGACCTT 



a FFISQLLFSFILTIFFD * LE 

b FSFHSCYFLLS * Q Y F L I S W K - 

FHFTAVI FFYLNNI FCLAGS- 



GTAAAAAGTATCAAATAAGAGAAGCGCTAGACTGAGGTAACTTAGCTTATTCACATTCAT 
CATTTTTCATAGTTTATTCTCTTCGCGATCTGACTC 



a vksik-ekr*tevt-lihih' 

b *KVSNKRSARLR * L S L F T F I 

KKYQIREALDCCNLAYSHS- 



f?i AGATCCACCTTCATATATCCAATACGATGATAAGGAAACAGCAGTCATCCGTTTTAAAAA 

K 1261 * — — 1320 

C | TCTAGCTGGAAGTATATAGGTTATGCTACTATTCCTTTGTCGTCAGTAGGCAAAATT*TTT 

W. a RSTFIYPIRC*GNSSHPF-K 
£ b DRPSYIQYDDKETAV IRFKN 

mi c IDLHISNTMIRKQQSSVLK I - 

hi TAGTGCTATGAGGACTAAATTTTTAGAGTCAAGAAATGGAG 

5 , 1321 * ♦ * * * 1^330 

N ATCACGATACTCCTGATTTAAAAATCTCAGTTCTTTACCTCGGCTTTAGAATTAGTT^ 

L * •CYED'IFRVKKWSRNLNOK 
i»] b SAMRTKFLESRNGAE I L IKK 

Li C VLCGLNF • SQEMEPKS * SKR- 

H GAATTGCGTCGATATTGCAAAAGAATCGAACTCTAAATCTTTCGTTAATAAGTATTACCA 
ffl 1381 - 

J, CTTAACGCAGCTATAACGTTTTCTTAGCTTGAGATTTAGAAAGCAATTATTCATAATGGT 
a ELRHYCKR IEL" IFR * " V L P 

MCVDIAKESNSKSFVNKYYO- 
c lASILQKNRTLNLSLlSlTN- 

ATCTTGATTCATTGAAGAGATTGACGAGGCAACTGCACAGAAGATCATTAAAGAAATAAA 
U41 ? _ isqo 

TAGAACTAAC7AACTTCTCTAACTGCTCCGTTGACGTGTCTTCTACTAATTTCTTTATTT 

ILIDCRD* RGNCTEDH • R N X 
SCLZEEZOEATAQKIlXEl'x- 
LOCLKRLTRQLHRRSLXK • 5- 

1501 GTAACTTTTA ™ A ^ AGAGAATA ^ 

C ATTGAAAATAATTAATC TCTT ATTTG ATTT AATG ATTAT ATCTCT AGTCGCT AC AAG TT 1560 

* V TF!N-RIN-ITNIEISOLO 
b *^LLlRE-TKLLr-RSAIF N. 
C NfV*LENKLNY-YRDORSSl- 

TTGACGAAATAAAAGCTGAACTAAAGTTAGACAATAAAAAATACAAACCTTCCTCAAAAT 
1561 ♦ «. . _ 1620 

AACTGCTTTATTTTCG ACTTG A TTTC AATCTGTTATTTTTT ATGTTTGG AACC AGTTTT A 

* L T K " X L N ' S * T I KNTN LGON 
b CRNXSCTKVRO-KlQTLVKr- 

DEIXAELKLONKKYK P W S K Y - 

ATTGAGG AAGGAAAAG AAG AC C ACTT ACC AAAAG AAAAAAT AACGC AAT AAAT AAAATC A 
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- 1680 

TAACTCCTTCCTTTTCTTCTGGTCAATCGTTTTCTTTTTTATTCCCTTATTTATTTTACT 
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GTACACAAGTGAAGAAATAAAAGATTTATTTTTTTCAATAATTTATTGAAAAGAGGGGTT 

L6B1 - — ♦ ♦ ♦ * 1740 

CATGTCTTCACTTCTTTATTTTCTAAATAAAAAAAGTTA 

a VQKCRWKRFIFFNNLLKRCV 

b YSSEEIKDLFFSIIYCKEGF- 

c TEVKK * K I Y F* F Q * FI EKRGF- 

TTCCGCTTTTCCCCTTTTGCCG 

U41 1762 

AACCCCAAAACCCCAAAACCCC 
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FIGURE 19 



1 aactcattta attactaatt taatcaacaa gattgataaa aagcagtaaa taaaacccaa 
61 tagatttaat ttagaaagta tcaattgaaa aatggaaatt gaaaacaact aagcacaata 
121 gccaaaagcc gaaaaattgt ggtgggaact tgaattagag atgcaagaaa accaaaatga 
181 tatataagtt agggttaaga ttgacgatcc taagcaatat ctcgtgaacg tcactgcagc 
241 atgtttgttg taggaaggta gttactacta agataaagat gaaagaagat atatcatcac 
301 taaagcactt cttgaggtgg ctgagtctga tcctgagttc atctgctagt tggcagtcta 
361 catccgtaat gaactttaca tcagaactac cactaactac attgtagcat tttgtgttgt 
421 ccacaagaat actcaaccat tcatcgaaaa gtacttcaac aaagcagtac ttttgcctaa 
481 tgacttactg gaagtctgtg aatttgcata ggttctctat atttttgatg caactgaatt 
54 1 caaaaatttg tatcttgata ggatactttc ataagatatt cgtaaggaac tcactttccg 
601 taagtgttta caaagatgcg tcagaagcaa gttttctgaa ttcaacgaat actaacttgg 
661 taagtattgc actgaatcct aacgtaagaa aacaatgttc cgttacctct cagttaccaa 
721 caagtaaaag tgggattaaa ctaagaagaa gagaaaagag aatctcttaa ccaaacttta 
781 ggcaataaag gaatctgaag ataagtccaa gagagaaact ggagacataa tgaacgttga 
841 agatgcaatc aaggctttaa aaccagcagt tatgaagaaa atagccaaga gatagaatgc 
901 catgaagaaa cacatgaagg cacctaaaat tcctaactct accttggaat caaagtactt 
961 gaccttcaag gatctcatta agttctgcca tatttctgag cctaaagaaa gagtctataa 
1021 gatccttggt aaaaaatacc ctaagaccga agaggaatac aaagcagcct ttggtgattc 
1081 tgcatctgca cccttcaatc ctgaattggc tggaaagcgt atgaagattg aaatctctaa 
1 141 aacatgggaa aatgaactca gtgcaaaagg caacactgct gaggtttggg ataatttaat 
1201 ttcaagcaat taactcccat atatggccat gttacgtaac ttgtctaaca tcttaaaagc 
1261 cggtgtttca gatactacac actctattgt gatcaacaag atttgtgagc ccaaggccgt 
1321 tgagaactcc aagatgttcc ctcttcaatt ctttagtgcc attgaagctg ttaatgaagc 
1381 agttactaag ggattcaagg ccaagaagag agaaaatatg aatcttaaag gtcaaatcga 
1441 agcagtaaag gaagttgttg aaaaaaccga tgaagagaag aaagatatgg agttggagta 
1501 aaccgaagaa ggagaatttg ttaaagtcaa cgaaggaatt ggcaagcaat acattaactc 
1561 cattgaactt gcaatcaaga tagcagttaa caagaattta gatgaaatca- aaggacacac 
1 62 1 tgcaatcttc tctgatgttt ctggttctat gagtacctca atgtcaggtg gagccaagaa 
1681 gtatggttcc gttcgtactt gtctcgagtg tgcattagtc cttggtttga tggtaaaata 
1 74 1 acgttgtgaa aagtcctcat tctacatctt cagttcacct agttctcaat gcaataagtg 
1801 ttacttagaa gttgatctcc ctggagacga actccgtcct tctatgtaaa aacttttgca 
1861 agagaaagga aaacttggtg gtggtactga tttcccctat gagtgcattg atgaatggac 
1921 aaagaataaa actcacgtag acaatatcgt tattttgtct gatatgatga ttgcagaagg 
1981 atattcagat atcaatgtta gaggcagttc cattgttaac agcatcaaaa agtacaagga 
204 1 tgaagtaaat cctaacatta aaatctttgc agttgactta gaaggttacg gaaagtgcct 
2101 taatctaggt gatgagttca atgaaaacaa ctacatcaag atattcggta tgagcgattc 
2161 aatcttaaag ttcatttcag ccaagcaagg aggagcaaat atggtcgaag ttatcaaaaa 
222 1 ctttgccctt caaaaaatag gacaaaagtg agtttcttga gattcttcta taacaaaaat 
2281 ctcaccccac ttttttgttt tattgcatag ccattatgaa atttaaatta ttatctattt 
234 1 atttaagtta cttacatagt ttatgtatcg cagtctatta gcctattcaa atgattctgc 
2401 aaagaacaaa aaagattaaa a 
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FIGURE 21 

1 tcaatactat taattaataa ataaaaaaaa gcaaactaca aagaaaatgt caaggcgtaa 
61 ctaaaaaaag ccataggctc ctataggcaa tgaaacaaat cttgattttg tattacaaaa 
121 tctagaagtt tacaaaagcc agattgagca ttataagacc tagtagtaat agatcaaaga 
181 ggaggatctc aagcttttaa agttcaaaaa ttaagattag gatggaaact ctggcaacga 
24 1 tgatgatgat gaagaaaaca actcaaataa ataataagaa ttattaagga gagtcaatta 
301 gattaagtag caagtttaat tgataaaaaa agttggttct aaggtagaga aagatttgaa 
361 tttgaacgaa gatgaaaaca aaaagaatgg actttctgaa tagcaagtga aagaagagta 
421 attaagaacg attactgaag aataggttaa gtattaaaat ttagtattta acatggacta 
48 1 ccagttagat ttaaatgaga gtggtggcca tagaagacac agaagagaaa cagattatga 
541 tactgaaaaa tggtttgaaa tatctcatga ccaaaaaaat tatgtatcaa tttacgccaa 
601 ctaaaagaca tcatattgtt ggtggcttaa agattatttt aataaaaaca attatgatca 
661 tcttaatgta agcattaaca gactagaaac tgaagccgaa ttctatgcct ttgatgattt 
721 ttcacaaaca atcaaactta ctaataattc ttactagact gttaacatag acgttaattt 
781 tgataataat ctctgtatac tcgcattgct tagattttta ttatcactag aaagattcaa 
841 tattttgaat ataagatctt cttatacaag aaattaatat aattttgaga aaattggtga 
901 gctacttgaa actatcttcg cagttgtctt ttctcatcgc cacttacaag gcattcattt 
961 acaagttcct tgcgaagcgt tctaatattt agttaactcc tcatcataaa ttagcgttaa 
1021 agatagctaa ttataggtat actctttctc tacagactta aaattagttg acactaacaa 
1081 agtccaagat tattttaagt tcttataaga attccctcgt ttgactcatg taagctagta 
1141 ggctatccca gttagtgcta ctaacgctgt agagaacctc aatgttttac ttaaaaaggt 
1201 caagcatgct aatcttaatt tagtttctat ccctacctaa ttcaattttg atttctactt 
1261 tgttaattta taacatttga aattagagtt tggattagaa ccaaatattt tgacaaaaca 
1321 aaagcttgaa aatctacttt tgagtataaa ataatcaaaa aatcttaaat ttttaagatt 
1381 aaacttttac acctacgttg cttaagaaac ctccagaaaa cagatattaa aacaagctac 
1441 aacaatcaaa aatctcaaaa acaataaaaa tcaagaagaa actcctgaaa ctaaagatga 
1501 aactccaagc gaaagcacaa gtggtatgaa attttttgat catctttctg aattaaccga 
1561 gcttgaagat ttcagcgtta acttgtaagc tacccaagaa atttatgata gcttgcacaa 
1621 acttttgatt agatcaacaa atttaaagaa gttcaaatta agttacaaat atgaaatgga 
1681 aaagagtaaa atggatacat tcatagatct taagaatatt tatgaaacct taaacaatct 
1 74 1 taaaagatgc tctgttaata tatcaaatcc tcatggaaac atttcttatg aactgacaaa 
1801 taaagattct actttttata aatttaagct gaccttaaac taagaattat aacacgctaa 
1 86 1 gtatacmt aagtagaacg aatmaatt taataacgtt aaaagtgcaa aaattgaatc 
1 92 1 ttccfcatta gaaagcttag aagatattga tagtctttgc aaatctattg cttcttgtaa 
1981 aaatttacaa aatgttaata ttatcgccag tttgctctat cccaacaata tttagaaaaa 
204 1 tcctttcaat aagcccaatc ttctattttt caagcaattt gaataattga aaaatttgga 
2101 aaatgtatct atcaactgta ttcttgatca gcatatactl aattctattt cagaattctt 
2161 agaaaagaat aaaaaaataa aagcattcat tttgaaaaga tattatttat tacaatatta 
2221 tcttgattat actaaattat ttaaaacact tcaatagtta cctgaattaa attaagttta 
2281 cattaattag caattagaag aattgactgt gagtgaagta cataagtaag tatgggaaaa 
234 1 ccacaagcaa aaagctttct atgaaccatt atgtgagttt atcaaagaat catcctaaac 
2401 cctttagcta alagattttg accaaaacac tgtaagtgat gactctatta aaaagatttt 
2461 agaatctata tctgagtcta agtatcatca ttatttgaga ttgaacccta gttaatctag 
2521 cagtttaatt aaatctgaaa acgaagaaat ttaagaactt ctcaaagctt gcgacgaaaa 
2581 aggtgtttta gtaaaagcat actataaatl ccctctatgt ttaccaactg gtacttatta 
264 1 cgattacaat tcagatagat ggtgattaat taaatattag tttaaataaa tattaaatat 
2701 tgaatatttc tttgcttatl atttgaataa tacatacaat agtcattttt agtgttttga 
2761 atatatttta gttatttaat tcattatttt aagtaaataa ttatttttca atcatttttt 
2821 aaaaaatcg 
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MKILFEFIQDKLDIDLQTNSTYKENLKCGHFNGLDEILTTCFAL 

PNSI^ALPCLPGDLSHKAVIDHCIIYLLTGELYNNVLTFGYKIARNEDVNNSLFCHS 

ANVNVTLLKGAAWKMFHSLVGTYAFVDLLrNYTVIQFNGQFFTQIVGNRCNEPHLPPK 

WVQRSSSSSATAAQIKQLTEPVTNKQFLHKLNINSSSFFPYSKJLPSSSSIKKLTDLR 

EAIFPTNLVIGPQRLKVRINLTLQKLLKJRHKRLNYVSILNSICPPLEGTVLDLSHLSR 

QSPKERVLKFIIVILQKXLPQEMFGSKKNKGKIIK1UNLLLSLPLNGYLPFDSLLKKL 

RLKDFRWLFISDIWFTKHNFENLNQLAICFISWLFRQLIPKIIQTFFYCTEISSTVTI 

VYTRHDTWh^KLITPFIVEYFKTYLVENNVCRNHNSYTLSNFNHSK^ 

IIAIPCRGADEEEFTIYKJENHKNAIQPTQKJLEYLRNKRPTSFTKIYSPTQIADRIKE 

FKQRLLKXIWVLPELYFMKFDVKSCYDSIPRMECMRILKDALKNENGFFVRSQYFFN 

TNTGVLKLFKmtASRVPKPYELYIDNVRTVHLSNQDVINVVEMEIFKTALWVEDKCY 

IREDGLFQGSSLSAPIVDLVYDDLLEFYSEFKASPSQDTLILKLADDFLIISTDQQQV 

rNIKKLAMGGFQKYNAKANRDKJLAVSSQSDDDTVIQFCAMmFVKELEVWlOiSSTMN 

NFHIRSKSSKG1FRSLIALFNTRISYKTIDTNLNSTNTVLMQIDHVVKNISECYKSAF 

KDLSINVTQNMQFHSFLQRJIEMTVSGCPITKCDPLIEYEVRFTILNGFLESLSSNTS 

KFKDNHLLRKEIQHLQAYIYIYIHIVN 
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LCVSY1LSSFYYANLEENALQFLRKESMDPEKPETNLLMRLT 
LCVSSILSSFYYATLEESSLGFLRDESMNPENPNVNLLMRLT 
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Motif 0 

human AKPLHWLMSVYWELLRSFFYVTETTFQKNR 

tezl l8EIEVfLVa^KRSNAKMCLSDFEKRXQIFAEFryWLYRSFIIPILQSFFYlTB6SDLRHR 

EST2 LKDFRWLFISD IWFTKHNFENLNQLAICFISWLPRQLIPXI IQTFFYCTEISSTVT- 

pl23 TREISWMQVET-SAKHFYYFDHEN-IYVLVOCLLRWIFBDLWSLIRCFFYVXEQQKSYSK 

* * * * * * 

• m • • • • • 

Motif 1 

human LFFYRKSVWSKLfiSIOIRQHLXRVQLRDVSEAEVRQHREARPALLTSRLRFIPKP— VOL 

te*l TVYFRKDlWKLLCRPPI-TSMKMEAFEKIHEHNVRHDTQK-TTLPPAVIRLLPKK--irrP 
EST 2 I VYFRHDTWNKLITPFI VEYFKTYLVENWCRNHNSYTLS — HFNHSKMRI I PKK6NHEP 

pl23 TYYYRKNIWDVIHKMS I-ADl*KKETLAEVQEKEVBEWKKS-LrGFAPGKLRLI PKK — TTF 

* • * * * ** 

i m * 0 • • • 

□ 

0 Motif 2 

h#fan RPIVHMDYWGARTFRREKRAERLTSRVKALF-8VLNYERA 

t«pl RLI TN-LRXRPLI KMGSNKXMLV6TNQTLRPVAS 3 LXHLl NEES SGI PPHLEV YMXLLTF 

ESp2 RX I AI PCRGADEEEFT I YX ENHXMAI QPTQK I LEYLRNXRPTS FTK I YS PTQ I ADR I KEF 

p£g3 RPIMTFHKKIVNSDRKTTKLTTNTKIXWSHLMIJCTLKK-RMFKDPPGFAVFNYDDVHKKY 

! ■ * * * * 

^ • 

L Motif 3 (A) 

t||l KKDLLKHRMFGR - KX YFVR I DI KSC Y DR I KQDLMFR I VKR- KLKDPEPVI RKY AT I HATS 

E8T2 KQRLLKKFNNVLPELYFMKFDVKSCYDSIPRMECMRILKD-ALKNENGFFVRSQYFFHTK 

p«3 EBFVCKWKQVG5PKLFFATMPIEXCYDSVNREKLSTFLKTTXLLSSDFWIMTAQILKRKH 

UJ * ***** * * 

• • • * m 

w 

in* 
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ATTTATACTCATGAAAATCTTATTCGAGTTCATTCAAGACAAGCTTGACATTGATCTACA 

GACCAACAGTACTTACAAAGAAAATTTAAAATGTGGTCACTTCAATGGCCTCGATGAAAT 

TCTAACTACGTGTTTCGCACTACCAAATTCAAGAAAAATAGCATTACCATGCCTTCCTGG 

TGACTTAAGCCACAAAGCAGTCATTGATCACTGCATCATTTACCTGTTGACGGGCGAATT 

ATACAACAACGTACTAACATTTGGCTATAAAATAGCTAGAAATGAAGATGTCAACAATAG 

TCTTTTTTGCCATTCTGCAAATGTTAACGTTACGTTACTGAAAGGCGCTGCTTGGAAAAT 

GTTCCACAGTTTGGTCGGTACATACGCATTCGTTGATTTATTGATCAATTATACAGTAAT 

TCAATTTAATGGGCAGTTTTTCACTCAAATCGTGGGTAACAGATGTAACGAACCTCATCT 

GCCGCCCAAATGGGTCCAACGATCATCCTCATCATCCGCAACTGCTGCGCAAATCAAACA 

ACTTACAGAACCAGTGACAAATAAACAATTCTTACACAAGCTCAATATAAATTCCTCTTC 

TTTTTTTCCTTATAGCAAGATCCTTCCTTCATCATCATCTATCAAAAAGCTAACTGACTT 

GAGAGAAGCTATTTTTCCCACAAATTTGGTTAAAATTCCTCAGAGACTAAAGGTACGAAT 

TAATTTGACGCTGCAAAAGCTATTAAAGAGACATAAGCGTTTGAATTACGTTTCTATTTT 

GAATAGTATTTGCCCACCATTGGAAGGGACCGTATTGGACTTGTCGCATTTGAGTAGGCA 

ATCACCAAAGGAACGAGTCTTGAAATTTATCATTGTTATTTTACAGAAGTTATTACCCCA 

AGAAATGTTTGGCTCAAAGAAAAATAAAGGAAAAATTATCAAGAATCTAAATCTTTTATT 

AAGTTTACCCTTAAATGGCTATTTACCATTTGATAGTTTGTTGAAAAAGTTAAGATTAAA 

GGATTTTCGGTGGTTGTTCATTTCTGATATTTGGTTCACCAAGCACAATTTTGAAAACTT 

GAATCAATTGGCGATTTGTTTCATTTCCTGGCTATTTAGACAACTAATTCCCAAAATTAT 

ACAGACTTTTTTTTACTGCACCGAAATATCTTCTACAGTGACAATTGTTTACTTTAGACA 

TGATACTTGGAATAAACTTATCACCCCTTTTATCGTAGAATATTTTAAGACGTACTTAGT 

CGAAAACAACGTATGTAGAAACCATAATAGTTACACGTTGTCCAATTTCAATCATAGCAA 

AATGAGGATTATACCAAAAAAAAGTAATAATGAGTTCAGGATTATTGCCATCCCATGCAG 

AGGGGCAGACGAAGAAGAATTCACAATTTATAAGGAGAATCACAAAAATGCTATCCAGCC 

CACTCAAAAAATTTTAGAATACCTAAGAAACAAAAGGCCGACTAGTTTTACTAAAATATA 

TTCTCCAACGCAAATAGCTGACCGTATCAAAGAATTTAAGCAGAGACTTTTAAAGAAATT 

TAATAATGTCTTACCAGAGCTTTATTTCATGAAATTTGATGTCAAATCTTGCTATGATTC 

CATACCAAGGATGGAATGTATGAGGATACTCAAGGATGCGCTAAAAAATGAAAATGGGTT 

TTTCGTTAGATCTC7VATATTTCTTCAATACCAATACAGGTGTATTGAAGTTATTTAATGT 

TGTTAACGCTAGCAGAGTACCAAAACCTTATGAGCTATACATAGATAATGTGAGGACGGT 

TCATTTATCAAATCAGGATGTTATAAACGTTGTAGAGATGGAAATATTTAAAACAGCTTT 

GTGGGTTGAAGATAAGTGCTACATTAGAGAAGATGGTCTTTTTCAGGGCTCTAGTTTATC 

TGCTCCGATCGTTGATTTGGTGTATGACGATCTTCTGGAGTTTTATAGCGAGTTTAAAGC 

CAGTCCTAGCCAGGACACATTAATTTTAAAACTGGCTGACGATTTCCTTATAATATCAAC 

AGACCAACAGCAAGTGATCAATATCAAAAAGCTTGCCATGGGCGGATTTCAAAAATATAA 

TGCGAAAGCCAATAGAGACAAAATTTTAGCCGTAAGCTCCCAATCAGATGATGATACGGT 

TATTCAATTTTGTGCAATGCACATATTTGTTAAAGAATTGGAAGTTTGGAAACATTCAAG 

CACAATGAATAATTTCCATATCCGTTCGAAATCTAGTAAAGGGATATTTCGAAGTTTAAT 

AGCGCTGTTTAACACTAGAATCTCTTATAAAACAATTGACACAAATTTAAATTCAACAAA 

CACCGTTCTCATGCAAATTGATCATGTTGTAAAGAACATTTCGGAATGTTATAAATCTGC 

TTTTAAGGATCTATCAATTAATGTTACGCAAAATATGCAATTTCATTCGTTCTTACAACG 

CATCATTGAAATGACAGTCAGCGGTTGTCCAATTACGAAATGTGATCCTTTAATCGAGTA 

TGAGGTACGATTCACCATATTGAATGGATTTTTGGAAAGCCTATCTTCAAACACATCAAA 

ATTTAAAGATAATATCATTCTTTTGAGAAAGGAAATTCAACACTTGCAAGC 
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AKFLHWLMSVYVVELLRSFFYVTETTFQKNRLFFYRKSVWSKLQSIGIRQHLKR 

VQLRJDVSEAEVRQHREARPALLTSRLRFIPKPDGLRPIVNMDYVVGARTFRREKR 

AERLTSRVKALFSVLNYERA 
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GCCAAGTTCCTGCACTGGCTGATGAGTGTGTACGTCGTCGAGCTGCTCAGGTC 

TTTCTTTTATGTC ACGGAGACCACGTTTCAAAAG AAC AGGCTC I " 1 ' 1 TI CTACC 

GGAAGAGTGTCTGGAGCAAGTTGCAAAGCATTGGAATCAGACAGCACTTGAA 

GAGGGTGCAGCTGCGGGACGTGTCGGAAGCAGAGGTCAGGCAGCATCGGGA 

AGCCAGGCCCGCCCTGCTGACGTCCAGACTCCGCTTCATCCCCAAGCCTGACG 

GGCTGCGGCCGATTGTGAACATGGACTACGTCGTGGGAGCCAGAACGTTCCG 

CAGAGAAAAGAGGGCCGAGCGTCTCACCTCGAGGGTGAAGGCACTGTTCAGC 

GTGCTCAACTACGAGCGGGCGCG 
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MTEHHTPKSRJLRFLENQYVYLCTLNDYVQLVLRGSPASSYSNICERLRSDVQTSFSIFLHSTVVGF 

DSKPDEGVQFSSPKCSQSELIANVVKQMFDESFERRRNLLMKGFSMNHEDFRAMHVNGVQNDLV 

STFPNYLISILESKNWQLLLE1IGSDAMHYLLSKGSIFEALPNDNYLQISGIPLFKWVFEETVSKKRK 

RTIETSITQNKSARKEVSWNSISISRFS1FYRSSYKKFKQDLYFNLHSICDRNTVHMWLQWIFPRQFG 

LINAFQVKQLHKVIPLVSQSTVVPKRLLKVYPLIEQTAKRLHRJSLSKVYNHYCPYIDTHDDEK1LS 

YSLKPNQVFAFLRS1LVRVFPKLIWGNQRJFEI1LKDLETFLKLSRYESFSLHYLMSNIKISEIEWLVL 

GKRSNAKMCLSDFE1GIKQIFAEFIYWLYNSFIIPILQSFFYITESSDLRNRTVYFRKDIWKLLCRPFIT 

SMKMEAFEKINEhflWRMDTQKTTLPPAVIRLLPKXhnTRLITOLRKRFLIKMGSNKKMLVSTO 

LRPVASILKHLINEESSGIPF>ILEVYMKLLTFKKDLLKHRMFGRKKYFVRIDIKSCYDRIKQDLMFR 

ivkkklkdpefvirkyatihatsdratknfvseafsyfdmvpfekvvqLlsmktsdtlfvdfvdy 
wtkssseifkmlkehlsghivkignsqylqkvgipqgsilssflchfymedlideylsftkkkgsvl 
lrvvddflfitvnkkdakkflnlslrgfeiowfstsle^ 

fsvnmrsldtllacpkidealfnstsveltkhmgksffykilrsslasfaqvfidithnskfnsccni 

YRLGYSMCMRAQAYLKJIMKDIFIPQRMFITDLLNVIGRKIWKKLAEILGYTSRRFLSSAEVKWLFC 
LGMRDGLKPSFKYHPCFEQLIYQFQSLTDLIKPLRPVLRQVLFLHRRJAD 
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ggtaccgatttacmcctncncataagctaattgcttcctcgaacgctcctaaatctctggaaatatttuacaagaactcaataacaataccaagtcaaattccaatatgaagg 

tgttattagtgatcgataatamctattttatcggtcgttaccaagtataa 

ggttcgcttactmaatcgtggtactgtmagctgctacttctagccaa^ 

ctgatgagactatanagancanacagtccgtgcatancttaacatggagccttacactttagatgagtcacgtcgcatgatggagtamggtatcatccaa 
aaaaggttgataattamgcaaaatcatgtccttagtggtggtaatccgcgaaagj^ 
ctccmaacgcggttttattmctatmctattctcatgttgttccaaatat^^ 
aataatctaaattagmcgcttataattgatagtagtagaaagattggtgattctatf 

aaaatcctataattataaaunaatcaatamgcggtcactatttatttaaaacgttatgatcagtaggacacmgcatatatatagtt^ 

GACCGAACACCATACCCCCAAAAGCAGGATTCTTCGCTTTCTAGAGAATCAATATGTATACCTATGTA 

CCTTAAATGATTATGTACAACTTfeTTTTGAGAGGGTCGCCGGCAAGCTCGTATAGCAATATATGCGAA 

CGCTTGAGAAGCGATGTACAAACGTCCTTTTCTATTTTTCTTCATTCGACTGTA 

AAGCCAGATGAAGGTGTTCAATTTTCTTCTC^ 

ggatagctaatatatgggcagCTAATAGCGAATGTTGTAAAACAGATGTTCGATGAAAGTTTTGAGCGTCGAAGGA 
ATCTACTGATGAAAGGGTTTTCCATGgtaaggtattctaattgtgaaatamacctgcaattactgmcaaagagangtam 
TCATGAAGATTTTCGAGCCATGCATGTAAACGGAGTACAAAATGATCTCGTTTCTACTTTTCCTAATTA 
CCTTATATCTATACTTGAGTCAAAAAATTGGCAACTTTTGTTAGAAATgtaaataccgg^ 
agactgacaagtatagTATCGGCAGTGATGCCATGCATTACTTATTATCCAAAGGAAGTATTTTTGAGGCTCTTC 
CAAATGACAATTACCTTCAGATTTCTGGCATACCACTTTTTAAAAATAATGTGTTTGAGGAAACTGTGT 
CAAAAAAAAGAAAGCGAACCATTGAAACATCCATTACTCAAAATAAAAGCGCCCGCAAAGAAGTTTC 
CTGGAATAGCATTTCAATTAGTAGGTTTAGCATTTTTTACAGGTCATCCT^ 
H aactaatactgttatccncataactaattttag^ 

p TGGCTTCAATGGATTTTTCCAAGGCAATTTGGACTTATA^ 

0 AGTGATTCCACTGGTATCACAGAGTACAGTTGTGCCCAAACGTCTCCTAAAGGTATACCCTTTAATTGA 
U1 ACAAACAGCAAAGCGACTCCATCGTATTTCTCTATCAAAAGTTTACAACCATTATTGCCCATATATTGA 
,|S CACCCACGATGATGAAAAAATCCTTAGTTATTCCTTAAAGCCGAACCAGGTGTTTGCGTTTCTTCGATC 

01 CATTCTTGTTCGAGTGTTTCCTAAATTAATCTGGGGTAACCAAAGGATATTTGAGATAATATTAAAAGg 
Ik tattgtataaaamattaccactaacgattttaccagACCTCGAAACTT^ 

j»$ ATTATTTAATGAGTAACATAAAGgtaatatgccaaattttm^^ 

I CCTTGGAAAAAGGTCAAATGCGAAAATGTGCTTAAGTGATTTTGAGAAACGCAAGCAAATATTTGCGG 
□ AATTCATCTACTGGCTATACAATTCGTTTATAATACCTATTTTACAATC I'l l 11 I I I ATATCACTGAATC 

AAGTGATTTACGAAATCGAACTGTTTATT^ 
i I TACATCAATGAAAATGGAAGCGTTTGAAAAAATAAACGAGgtanttaaagtattttttgcaaaaagctaatattttcagAACAA 

TGTTAGGATGGATACTCAGAAAACTACTTTGCCTCCAGCAGTTATTCGTCTATTACCTAAGAAGAATAC 
'tt CTTTCGTCTCATTACGAATTTAAGAAAAAGATTCTTAATAAAGgtattaattmggtcatcaatgta 
t{ cagATGGGTTCAAACAAAAAAATGTTAGTCAGTACGAACCAAACTTTACGACCTGTGGCATCGATACTG 
?M AAACATTTAATCAATGAAGAAAGTAGTGGTATTCCATTTAACTTGGAGGTTTACATGAAGCTTCTTACT 

TTTAAGAAGGATCTTCTTAAGCACCGAATGTTTGGg 

TATTTTGTACGGATAGATATAAAATCCTGTTATGATCGAATAAAGCAAGATTTGATGTT^ 
AAAAAGAAACTCAAGGATCCCGAATTTGTAATTCGAAAGTATGCAACCATACATGCAACAAGTGACCG 
AGCTAC AA AA AACTTTGTTAGTG AGGCG 1 III CCTATTgtaagtttattttttcattggaattttttaacaaattcttttttagTTGATAT 
GGTGCCTTTTGAAAAAGTCGTGCAGTTACTTTC 

GGATTATTGGACCAAAAGTTCTTCTGAAATTTTTAAAATGCTCAAGGAACATCTCTCTGGACACATTGT 

TAAGgtataccaangttgaattgtaataacactaatgaaactagATAGGAAATTCTCAATACCTTCAAAAAGTTGGTATCCCTC 

AGGGCTCAATTCTGTCATCTTTTTTGTGTCATTTCTATATGGAAGATTTGATTGA 

TACGAAAAAGAAAGGATCAGTGTTGTTACGAGTAGTCGACGATTTCCTCTTTATAACAGTTAATAAAA 

AGGATGCAAAAAAATTTTTGAATTTATCTTT^ 

ACACAATTTTTCTACGAGCCTGGAGAAAACAGTAATAAACTTTGAAAATAGTAATGGGATAATAAACA 
ATACTTTTTTTAATGAAAGCAAGAAAAGAATGCCATTCTTCGGTTTCTCTGTGAAC 
ATACATTGTTAGCATGTCCTAAAATTGATGAAGCCTTATTTAACTCTACATCTGTAGAGCTGACGAAAC 
ATATGGGGAAATCri l l ITTTACAAAATTCTAAGgtatactgtgtaactgaataatagctgacaaataatcagATCGAGCCTTGC 
ATCCTTTGCACAAGTATTTATTGACATTACCCACAATTCAAAATTCAATTCTTGCTGCAATATAT^ 
GCTAGGATACTCTATGTGTATGAGAGCACAAGCATACTTAAAAAGGATGAAGGATATATTTATTCCCC 
A A AG A ATGTTCATA ACGGgtgagtacttattttaactagaaaagtcattaattaaccttagATC i 1 I rGAATGTTATTGGAAGAAAA 
ATTTGGAAAAAGTTGGCCGAAATATTAGGATATACGAGTAGGCGTTTCTTGTCCTCTGCAGAAGTCAA 
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ATGgtacgtgtcggtctcgagacttcagcaatattgacacatcagGC I TTTTl GTCTTGGAATGAGAGATGG 1 I I GAAACCCTCTT 

TCAAATATCATCCATGCTTCGAACAGCTAATATACCAATTTCAGTCATTGACTGATCTTATCAAGCCGC 

TAAGACCAGTTTTGCGACAGGTGTTATTTTTACATAGAAGAATAGCTGATTAAtgtcatmcaam 

tattactggtgtcttaaacaatattattactaagtatagctgacccccaaagcaagcatactataggamctagtaaagtaaaanaatctcgnanagtmgattga 

atccttatacttttaagaaagattgacagtggttgctgactactgcccacatgcccattaaacgggagtggnaaacaltaaaagtaatacatgaggctaa 

aataaggaaagtggtmctataatgaataatgcccgcactaatgcaaaaagacgaagattatc^^ 

acccagtgttgttgaagaaagcaaggataamggaacaagcttctgcagatgacaggctaaatmggtgaccgaatmggtaaaagccccaggttatccatgg 
gccttgctactgagacgaaaagaaactaaggatagmgaatactaatagctrattm^ 

gtgttaagccattanggattccgaaaiagccaaantcttggttcctcaaagcggaagtctaaagaacttattgaagcttatgaggcttcaaaaactcctcctgamaaa 

gaatcttccaccgatgaggaaatggatagcttatcagctgctgaggagaagcctaattttttgcaaaaaagaaaatatcangggagacatctcugatgaatcagatgcgga 

gagtatctccagcggatccttgatgtcaataacnctatttctgaaatgtatggtcctactgtcgcttcgacttctcgtagctctacgcagttaagtgaccaaaggtacc 
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EST2 pep 

Euplotes pep 

Trans of tetrahymen 

Consensus 



EST2 pep 

Euplotes pep 

Trans of tetrahymen 

Consensus 



EST2 pep 

Euplotes pep 

Trans of tetrahymen 

M Consensus 

□ 

0 EST2 pep 

\£) Euplotes pep 

Trans of tecrahymen 



4 



FFYCTEISST VTIVYFRHDT WN KLIT I FIVE YFK-TYLVEN 

FFYVTEQQKS YSKTYYYRKN IWDVT-MKMS IAD LKK ETIA—E^E 

I-----KHKE GSQIFYYRKP IWKLVSKLTI VKVRIQFSEK NKQMKNNFYQ 
FFY.TE. . K. .S . . YYYKK. IW...-KL F. -K V. . 



NVCRb^HNSY- 

KEVEEWKKSL 

KIQLEEENLE KVEEKLIPED SFQKY: 

v p . . . . . F 



GFAPCK0* 



GKfe; 



TL^NFNH^tiiW^^N^ 




HJPCRG 
tfTFNKK 
TTFLRK 

-TTF.RK 



ADEEBFTIYK ENHKNAIQP^^KILE^fkNK 
rVNSDRKTTK LTITHMJIN^HLMLJCI £ Q* - 
DKQKNIK LNIJ^II^S|^LW^|:<ti- 



FKQRI»LK K FN 



.K. .K LN.N. .L. .'S> :; QL-L. 



RPTSF7KIYS FTQIADRIKE 

RKFK -DPFGFAVFN 

ML-G -QXIGYSVFD 
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Consensus 
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.K- 
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S-l: FFY VTE TTF QKN RLF FYR KSV WSK 
S-2: RQHLKRVQLRDVSEAEVRQHREA 
S-3: ART FRR EKR AER LTS RVK ALF SVL NYE 

A-l: AiCF LHW LMS VYV VEL LRS FFY VTE TTF Q 
A-2: LFF YRK SVW SKL QSI GJR QHL KRV QLR DVS 
A-3: PAL LTS RLR FIP KPD GLR PP/ NMD YW 
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Vector Genomic DNA Insert 



A5 



5825 bp Sequenced 
-2 kb Hind ill Fragment 




Vector 



1 kb 



tez/ + 



RT Motifs 12 3(A) 4(B') 5(C) 6(D) 



Introns 



2 J 




!i6 7 8 9 10 11 



12 13 14 15 



I 



I 



Hind III Xca 



Original PCR 
3* RT-PCR 



RT-PCR w/ M2-B14 
RT-PCR w/ M2-B15BC 
RT-PCR w/ M2-B15BC 
RT-PCR w/ M2-B16 




Hind III 



Bene 



500 bp 
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Poly 4 

t t c 

ta a g c c teg 
5'- cag acc aaa gga att cca taa gg -3' 
QTKGIPQQ 



4(B') 



b 
ui 



5(C) 



□ D D Y L L I T 

M 3 '- ctg ctg atg gag gag tag tgg -5 
h 8 a a a a a a a a a 

W t t t t 

0 c c 

Hi Poiv i 
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Motif B' (4) 

QTKGIPQG 



Motif C (5) 

DDYLLIT 
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PCR Product M2 showed Reasonable Match 
with Other Telomerase Proteins 



Ot LCVSYILSSFYYANLEENALQFLRKESMDPEKPETNLLMRLT 

Ea__pl23 KGIPQGLCVSSILSSFYYATLEESSLGFLRDESMNPENPNVNLLMRLTDDYLLIT 

Sp_M2 SILSSFLCHFYMEDLIDEYIiSFTKKK 6SVLLRW 

Sc_pl03 DGLFQGSSLSAPIVDLVYDDLLEFYSEFKASPS QDTLILKLADDFLIIS 



QKVGIPQG 

caa aaa gtt ggt acc cct cag gg 

Popyy 4 

D t t c 

t | a g c c teg 

cayrf acc aaa gga act cca taa gg > 

ap] acc aaa gga att cca tea ggC TCA ATT 
tjQi tgg ttt cct taa ggt agt ccG AGT TAA 

(ail 

1 KGIPSGSI 

GA^J GAT TTG ATT GAT GAA TAC CTA TCG TTT 
Clisff CTA AAC TAA CTA CTT ATG GAT AGC AAA 

E « y D LIDEYLSF 



< Actual Genomic Sequence. 



CTG TCA TCT TTT TTG TGT CAT TTC TAT ATG 
GAC AGT AGA AAA AAC ACA GTA AAG ATA TAC 

LSSFLCHFYM 

ACG AAA AAG AAA GGA TCA GTG TTG TTA CGA 
TGC TTT TTC TTT CCT AGT CAC AAC AAT GCT 

TKKKGSVLLR 



GTA GTC gac gac tac etc etc ate acc 
CAT CAG ctg ctg atg gag gag tag tgg 

VVD DYLL IT 

< ctg ctg atg gag gag tag tgg 

a a aaaaa a a 
t t t t 
c c 

Polv 1 



gac gat ttc etc ttt ata aca < Actual Genomic Sequence. 

D D F L F I T 
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3' RT PCR Strategy 



mRNA 



3' 



I AAAAXAAAAAAAAAAA. 

a^rrirrirrrTTTTTT ■ 



3' 



QO 
Ql 



1 . Synthesis of cDNA with Or Primer. 
mRNA 



3* 



IXAAAAAAXXAAAAAAA. 
ITTTTTTTTTTTTTTT ■ 



15' 



□ 



m 



;?First Round PCR Using Outside Primer and Qq Primer. 



ITTTTTTTTTTTTTTT ! 



i Second Round PCR Using Inside Primer and Q| Primer. 
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S3 



. Sequence Second Round PCR Products Using Inside Primer or Qj Primer. 
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-Size Selected Libraries from P. Nurese 
3-4kb 
5-6kb 
7 - 8 kb 
11 - 12 kb 

-Libraries from J.A. Wise 
Sou 3a Partial Digest 
Hind III Partial Digest 

cDNA Libraries 

GAD (Gal Activation Domain) Library 
REP Library from R. Allshire 
REP81ES Library (old) 
REP81ES Library (new) 
pj REP41ES Library 

Ul 



c. 



P. Nurse z 
o I 1 a 




C^ijginal PCR 
P3 f RT-PCR 



nj 




-► f 



500 bp 



Mg Cone 




□ 



4* 
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A5 B2 



23K- 
9.4K- 
6.5K- 



2.3K- 
2K~ 




Hind 111 Digested Positive Genomic Clones 
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5'RTPCR Strategy 



5' 



mRNA 



I AAAAAAAAAAAAAAAA . 



ddA* 



1 . Synthesis of cDN A with Specific Downstream Primer. 

mRNA 



I AAAAAAAAAAAAAAAA . 



M. Ugate OHgo with 5'-P and blocked 3' to cDNA using T4 RNA Ligase. 

a 

4 ■■■■■■■■■■■■■■ ■ mum 

U1 
01 

i 3. First Round PCR 



Oi. Second Round PCR 



.mmi 
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Alignment of RT Domains from Telomerase Catalytic Subunits. 

Motif 0 

S.p. Tezlp (429) . WLYNSFIIPILQSFFYITESSDLRNRP/YFRKDIW . . . (35) . . . 

S.c. Est2p (366). WLFRQLI PKI IQTFFYCTEI3STVT- IVYFRHDTW ...(35)... 

E.a. pl23 (441). WI FEDLWSL I RCFFYVTEQQKSYSKTYYYRKNI W ...(35)... 

Motif 1 Motif 2 K 
p hh h K hR h R 

S.p. Tezlp AVIRLLPKK — NTFRLITN- LRKRF ...(61)... 
S.c. Est2p SKMRI I PKKSNNEFRIIAI PCRGAD ...(62)... 
E.a. pl23 GKLR1IPKK--TTFRPIMTFNKKIV ...(61)... 

* * ★ * * ★ ★ 

Motif 3(A) AF 

h hDh GY h 
S.p. Tezlp KKYFVRIDIKSCYDRIKQDLMFRIVK ...(89)... 
S.C. Est2p ELYFMKFDVKSCYDSIPRMECMRILK ...(75)... 
E.a. pl23 KLFFATMDIEKCYDSVNREKLSTFLK ...(107)... 



Motif 4(B') 

hPQG pF hh h 
S.p. Tezlp YLQKVG I PQGS I LS 3FLCHFYMEDL I DE YLS F ...(6)... 
S.c. Est2p YIREDGLFQGSSLSAPIVDLVYDDLLEFYSEF ...(8)... 
Q E.a. pl23 YKQTKG I PQGLCVSSILSSFYYATLEESSLGF ...(14)... 

* * * * * * ★ 

M Y Motif 5(C) Motif 6(D) 

Oil t F DDhhh Gh h cK h 

O S.p. Tezlp VLLRWDDFLFITVNKKDAKKFLNLSLRGFEKHI'-JFSTSLEKTVINFENS .(205) 

fjj S.c. Est2p LILKIJyDDFLIISTDQQQVINIKKLAMGGFQKYl\'AKA:^RDKILAVSSQS .(173) 

E.a. pi 2 3 L1MP.LTDDYLLITTQENNAVLFI EKLINVSRENGF KF-^MKKLQTSFPLS .(209) 
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FIGURE 43 

Disruption strategy for the putative telomerase genes. 
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1 . Transform with linear fragment 
containing the telomerase gene 
disrupted with a LEU2 or uro4 marker. 
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2. Assay in selective media. 

3. Sporulate, and grow on 
selective media. 
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Senescence? (X2) 



(These cells will show a senescence phenotype 

if the disrupted gene encodes a telomerase subunit.) 
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FIGURE 44 



An Example of Confirmation of tezl disruption By PCR 
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FIGURE 45 



Tezl disruption causes progressive 
shortening of telomeres in S. pombe 
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FIGURE 46 



1 ggtaccgatttactttcctttcttcataagctaattgcttcctcgaacgctcctaaatctctggaaatatttttacaaga 80 
81 actcaataacaataccaagtcaaattccaatatgaaggtgttattagtgatcgataatatttctattttatcggtcgtta 160 
161 ccaagtataaggacaaaaagaacaacttccttccccctaaagacttttactttattaatttacttttcaaatatatttcg 240 
241 ggttcgcttacttttaatcgtggtactgttttagctgctacttctagccaaccgcgtgtttctaccccgtcattggatat 320 
321 agctcttggagtagctcacagaaa tec ttacaaatcttctgatgagactatattagat teat tacagtccgtgcatattc 400 
401 ttaacatggagccttacactttagatgagtcacgtcgcatgatggagtatttggtatcatccaacgtttgccttgaaaag 480 
481 gttgataattatttgcaaaatcatgtccttagtggtggtaatccgcgaaagttttttgatgcttgcacacgtctagcatg 560 
561 attgagatattcaaaaatttctatccactacaactcctttaacgcggttttatttttctattttctattctcatgttgtt 640 
641 ccaaatatgtatcatctcgtattaggcttttttccgttttactcctggaatcgtacctttttcactattccccctaatga 720 
721 ataatctaaattagtttcgcttataattgatagtagtagaaagattggtgattctactcgtgtaatgttattagtttaaa 800 
801 gatactttgcaaaacatttattagctatcattatataaaaaaaatcctataattataaatattaatcaatatttgcggtc 880 
881 actatttatttaaaacgttatgatcagtaggacactttgcatatatatagttatgcttaatggttacttgtaacttgc 958 

959 ATG ACC GAA CAC CAT ACC CCC AAA AGC AGG ATT CTT CGC TTT CTA GAG AAT CAA TAT GTA 1018 
1MTEHHTPKSRILRFLENQYV 20 

1019 TAC CTA TGT ACC TTA AAT GAT TAT GTA CAA CTT GTT TTG AGA GGG TCG CCG GCA AGC TCG 1078 
21 Y LCTLNDYVQLVLRGS PAS S 40 

U 

107Sh|TAT AGC AAT ATA TGC GAA CGC TTG AGA AGC GAT GTA CAA ACG TCC TTT TCT ATT TTT CTT 1138 

4KY s nicerlrsdvqtsfsifl 60 

113|jcAT TCG ACT GTA GTC GGC TTC GAC AGT AAG CCA GAT GAA GGT GTT CAA TTT TCT TCT CCA 1198 
etnH STVVGFDSKPDEGVQFSSP 80 
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FIGURE 46 (cont) 

1199 AAA TGC TCA CAG TCA GAG gtatatatatttttgttttgatttttttctattcgggatagctaatatatgggcag 1272 

81 K C S Q S E 86 

1273 CTA ATA GCG AAT GTT GTA AAA CAG ATG TTC GAT GAA AGT TTT GAG CGT CGA AGG AAT CTA 1332 

87 L I ANVVKQMF DES FERRRNL 106 

1333 CTG ATG AAA GGG TTT TCC ATG gtaaggtattctaattgtgaaatatttacctgcaattactgtttcaaagaga 1405 

107 L M K G F S M ' H3 

1406 ttgtatttaaccgataaag AAT CAT GAA GAT TTT CGA GCC ATG CAT GTA AAC GGA GTA CAA AAT 1469 

114 NHEDF RAM HVNGVQN 128 

1470 GAT CTC GTT TCT ACT TTT CCT AAT TAC CTT ATA TCT ATA CTT GAG TCA AAA AAT TGG CAA 1529 

129 D LVSTFPNYLISI LE5KNWQ 148 

1530 CTT TTG TTA GAA AT gtaaataccggttaagatgttgcgcactttgaacaagactgacaagtatag T ATC GGC 1601 

149 L L L E I I G 155 

1602 AGT GAT GCC ATG CAT TAC TTA TTA TCC AAA GGA AGT ATT TTT GAG GCT CTT CCA AAT GAC 1661 

156 SDAMHYLLSKGS I FEALPND 175 

La 

16613 AAT TAC CTT CAG ATT TCT GGC ATA CCA CTT TTT AAA AAT AAT GTG TTT GAG GAA ACT GTG 1721 

17C3N YLQISGIPLFKNNVFEETV 195 

Ul 

1722pTCA AAA AAA AGA AAG CGA ACC ATT GAA ACA TCC ATT ACT CAA AAT AAA AGC GCC CGC AAA 1781 

19f!S KKRKRTIETSITQNKSARK 215 

U 

178^GAA GTT TCC TGG AAT AGC ATT TCA ATT AGT AGG TTT AGC ATT TTT TAC AGG TCA TCC TAT 1841 

21% E V S W N S I S I S R F S I FYRS S Y 235 

184| J aAAG AAG TTT AAG CAA G gtaactaatactgttatccttcataactaattttag AT CTA TAT TTT AAC 

23|=|K KFKQD LYFN 245 

190^jTTA CAC TCT ATT TGT GAT CGG AAC ACA GTA CAC ATG TGG CTT CAA TGG ATT TTT CCA AGG 1967 

246 ?i L HS ICDRNTVHMW LQWI FPR 265 

1968 CAA TTT GGA CTT ATA AAC GCA TTT CAA GTG AAG CAA TTG CAC AAA GTG ATT CCA CTG GTA 2027 

266 QFGLIN AFQVKQLHKVI PLV 285 

2028 TCA CAG AGT ACA GTT GTG CCC AAA CGT CTC CTA AAG GTA TAC CCT TTA ATT GAA CAA ACA 2087 

286 SQSTVVPKRLLKVYPLI EQT 305 

2088 GCA AAG CGA CTC CAT CGT ATT TCT CTA TCA AAA GTT TAC AAC CAT TAT TGC CCA TAT ATT 2147 

306 AKRLHRISLS KVYNHYCPYI 325 

2148 GAC ACC CAC GAT GAT GAA AAA ATC CTT AGT TAT TCC TTA AAG CCG AAC CAG GTG TTT GCG 2207 

326 D T HDDEKILSYS LKPNQVFA 345 

2208 TTT CTT CGA TCC ATT CTT GTT CGA GTG TTT CCT AAA TTA ATC TGG GGT AAC CAA AGG ATA 2267 

346 F LRS I LVRV F P K L I WGNQR I 365 



2268 TTT GAG ATA ATA TTA AAA G gtattgtataaaatttattaccactaacgattt taccag AC CTC GAA ACT 2336 
366 FE IILKD LET 375 



t % 
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2337 TTC TTG AAA TTA TCG AGA TAC GAG TCT TTT AGT TTA CAT TAT TTA ATG AGT AAC ATA AAG 2396 

376 FLKLSRYESFSLHYLMSNIK 395 

2397 gtaatatgccaaatttttttaccattaattaacaatcag ATT TCA GAA ATT GAA TGG CTA GTC CTT GGA 246S 

396 ~ ISEIEWLVLG 405 

2466 AAA AGG TCA AAT GCG AAA ATG TGC TTA AGT GAT TTT GAG AAA CGC AAG CAA ATA TTT GCG 2525 

406 KR SNAKMCLSDFEKRKQI FA 425 

2526 GAA TTC ATC TAC TGG CTA TAC AAT TCG TTT ATA ATA CCT ATT TTA CAA TCT TTT TTT TAT 2585 

426 EFIYWLYNSF I I PILQSFFY 445 

2586 ATC ACT GAA TCA AGT GAT TTA CGA AAT CGA ACT GTT TAT TTT AGA AAA GAT ATT TGG AAA 2645 

446 IT ESSDLRNRTVYFRKDIWK 465 

2646 CTC TTG TGC CGA CCC TTT ATT ACA TCA ATG AAA ATG GAA GCG TTT GAA AAA -ATA AAC GAG 2705 

466 L LCRPFITSMKMEAFEKINE 485 

2706 gtattttaaagtattttttgcaaaaagctaatattttcag AAC AAT GTT AGG ATG GAT ACT CAG AAA ACT 2775 

486 " NNVRMDTQKT 495 

M. 

2*Q6 ACT TTG CCT CCA GCA GTT ATT CGT CTA TTA CCT AAG AAG AAT ACC TTT CGT CTC ATT ACG 2835 

4*16 TLPPAVIRLLPKKNTFRL IT 515 



2§j|6 AAT TTA AGA AAA AGA TTC TTA ATA AAG gtattaatttttggtcatcaatgtactttac ttctaatctatta 2906 

||6 N L R K R F L I K 524 

2|07 ttagcag ATG GGT TCA AAC AAA AAA ATG TTA GTC AGT ACG AAC CAA ACT TTA CGA CCT GTG 2967 

|25 MGSNKKMLVSTNQTLR PV 542 

2$ff>8 GCA TCG ATA CTG AAA CAT TTA ATC AAT GAA GAA AGT AGT GGT ATT CCA TTT AAC TTG GAG 3027 

ppA SILKHLINEESSGIPFNLE 562 

3§§8 GTT TAC ATG AAG CTT CTT ACT TTT AAG AAG GAT CTT CTT AAG CAC CGA ATG TTT GG gtaat 3088 

|l3 V YMKLL.TFK K DL LKHRMF G 581 

3089 tatataatgcgcgattcctcattattaattttgcag G CGT AAG AAG TAT TTT GTA CGG ATA GAT ATA 3155 

582 RKKYFVRIDI 591 

3156 AAA TCC TGT TAT GAT CGA ATA AAG CAA GAT TTG ATG TTT CGG ATT GTT AAA AAG AAA CTC 3215 

592 KSC YDR IKQDLMFRIVKKKL 611 

3216 AAG GAT CCC GAA TTT GTA ATT CGA AAG TAT GCA ACC ATA CAT GCA ACA AGT GAC CGA GCT 3275 

612KDPEFVIRKYATIHATSDRA 631 

3276 ACA AAA AAC TTT GTT AGT GAG GCG TTT TCC TAT T gtaagtttattttttcattggaattttttaacaa 3343 

632 T KNFVSEAFSYF 643 

3344 attcttttttag TT GAT ATG GTG CCT TTT GAA AAA GTC GTG CAG TTA CTT TCT ATG AAA ACA 3405 

644 DMVPFEKVVQLLSMKT 659 

3406 TCA GAT ACT TTG TTT GTT GAT TTT GTG GAT TAT TGG ACC AAA AGT TCT TCT GAA ATT TTT 3465 

660 SDTLFVDFVDYWTKSSSEIF 679 

3466 AAA ATG CTC AAG GAA CAT CTC TCT GGA CAC ATT GTT AAG gtataccaattgttgaattgtaataaca 3532 

680 K M L K E H L 5 G H I V K 692 
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3533 ctaatgaaactag ATA GGA AAT TCT CAA TAC CTT CAA AAA GTT GGT ATC CCT CAG GGC TCA 3593 

693 IGNSQYLQKVGI PQGS 708 

3594 ATT CTG TCA TCT TTT TTG TGT CAT TTC TAT ATG GAA GAT TTG ATT GAT GAA TAC CTA TCG 3653 

709 I LSSFLCHFYMEDLIDEYLS 728 

3654 TTT ACG AAA AAG AAA GGA TCA GTG TTG TTA CGA GTA GTC GAC GAT TTC CTC TTT ATA ACA 3713 

729 F TKKKGSVLLRVVDDF LF I T 748 

3714 GTT AAT AAA AAG GAT GCA AAA AAA TTT TTG AAT TTA TCT TTA AGA G gtgagttgctgtcattcc 3777 

749 V NKKDAKKFLNLSLRG 764 

3778 taagttctaaccgttgaag GA TTT GAG AAA CAC AAT TTT TCT ACG AGC CTG GAG AAA ACA CTA 3840 

765 FEKHNFSTSLEKTV 778 

3841 ATA AAC TTT GAA AAT AGT AAT GGG ATA ATA AAC AAT ACT TTT TTT AAT GAA AGC AAG AAA 3900 

779 I NFENSNGI INNTFFNESKK 798 

3901 AGA ATG CCA TTC TTC GGT TTC TCT GTG AAC ATG AGG TCT CTT GAT ACA TTG TTA GCA TGT 3960 

799 R MPFFGFSVNMRSLDTLLAC 818 

396# CCT AAA ATT GAT GAA GCC TTA TTT AAC TCT ACA TCT GTA GAG CTG ACG AAA CAT ATG GGG 4020 

8 llS PKIDEALFNSTSVELTKHMG 838 

□ 

403JJ AAA TCT TTT TTT TAC AAA ATT CTA AG gtatactgtgtaactgaataatagctgacaaataatcag A TCG 4089 

83p KSFFYKI LR S 848 

40M AGC CTT GCA TCC TTT GCA CAA GTA TTT ATT GAC ATT ACC CAC AAT TCA AAA TTC AAT TCT 4149 

5 LASFAQVFIDITHNSKFNS 868 



41SW TGC TGC AAT ATA TAT AGG CTA GGA TAC TCT ATG TGT ATG AGA GCA CAA GCA TAC TTA AAA 4209 

8&9 C C N I Y R LGY S MCM RAQAY L K 888 

42 fi AGG ATG AAG GAT ATA TTT ATT CCC CAA AGA ATG TTC ATA ACG G gtgagtacttattttaactaga 4274 

8MRMKDIFIPQRMFITD 903 

427^ aaagtcattaattaaccttag AT CTT TTG AAT GTT ATT GGA AGA AAA ATT TGG AAA AAG TTG GCC 4339 

904 LLNVIGRKIWKKLA 917 

4340 GAA ATA TTA JGGA TAT ACG .AGT AGG CGT TTC TTG TCC TCT GCA GAA GTC AAA TG gtacgtgtc 4401 

918 EI LGYTSRRF LSSAEVKW 935 

4402 ggtctcgagacttcagcaatattgacacatcag G CTT TTT TGT CTT GGA ATG AGA GAT GGT TTG AAA 4468 

936 ~ ~ LFCLGMRDGLK 946 

4469 CCC TCT TTC AAA TAT CAT CCA TGC TTC GAA CAG CTA ATA TAC CAA TTT CAG TCA TTG ACT 4528 

947 P S FKYHPCFEQLIYQFQSLT 966 

4529 GAT CTT ATC AAG CCG CTA AGA CCA GTT TTG CGA CAG GTG TTA TTT TTA CAT AGA AGA ATA 4588 

967 DL IKPLRPVLRQVLFLHRRI 986 

4589 GCT GAT TAA tgtcattttcaa tttattatatacatcctttattactggtgtcttaaacaatattattactaagtata 4665 

987 A D * 989 



% 
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4 666 gctgacccccaaagcaagcatactataggatttctagtaaagtaaaattaatctcgttattagttttgattgacttgtct 4745 
4746 ttatccttatacttttaagaaagattgacagtggttgctgactactgcccacatgcccattaaacgggagtggttaaaca 4825 
4 826 ttaaaagtaatacatgaggctaatctcctttcatttagaataaggaaagtggttttctataatgaataatgcccgcacta 4905 

4906 atgcaaaaagacgaagattatcttctaaacaagggggattaagcatatccgaaggaaaagagagtaatatacccagtgtt 4 985 
4986 gttgaagaaagcaaggataatttggaacaagcttctgcagatgacaggctaaattttggtgaccgaattttggtaaaagc 5065 
5066 cccaggttatccatggtggccggccttgctactgagacgaaaagaaactaagga tag tttgaatactaatagc teat tta 5145 
5146 atgtcttatataaggttttgttttttcctgacttcaattttgcatgggtgaaaagaaatagtgttaagccattattggat 5225 
5226 tccgaaatagccaaatttcttggttcctcaaagcggaagtctaaagaacttattgaagcttatgaggcttcaaaaactcc 5305 
5306 tcctgatttaaaggaggaatcttccaccgatgaggaaatggatagcttatcagctgctgaggagaagcctaattttttgc 5385 
5386 aaaaaagaaaatatcattgggagacatctcttgatgaatcagatgcggagagtatctccagcggatccttgatgtcaata 5465 
54 66 acttctatttctgaaatgtatggtcctactgtcgcttcgacttctcgtagctctacgcagttaagtgaccaaaggtacc 5544 




FIGURE 47 



l 

met ser val tyr val val glu leu leu 
GCCAAGTTCCTGCACTGGCTG ATG AGT GTG TAC GTC GTC GAG CTG CTC 

10 20 
arg ser phe phe tyr val thr glu thr thr phe gin lys asn arg 
AGG TCT TTC TTT TAT GTC ACG GAG ACC ACG TTT CAA AAG AAC AGG 

30 

leu phe phe tyr arg lys ser val trp ser lys leu gin ser ile 
CTC TTT TTC TAC CGG AAG AGT GTC TGG AGC AAG TTG CAA AGC ATT 

40 50 
gly ile arg gin his leu lys arg val gin leu arg glu leu ser 
GGA ATC AGA CAG CAC TTG AAG AGG GTG CAG CTG CGG GAG CTG TCG 

60 

glu ala glu val arg gin his arg glu ala arg pro ala leu leu 
GAA GCA GAG GTC AGG CAG CAT CGG GAA GCC AGG CCC GCC CTG CTG 

70 80 
thr ser arg leu arg phe ile pro lys pro asp gly leu arg pro 
ACG TCC AGA CTC CGC TTC ATC CCC AAG CCT GAC GGG CTG CGG CCG 

90 

ile val asn met asp tyr val val gly ala arg thr phe arg arg 
ATT GTG AAC ATG GAC TAC GTC GTG GGA GCC AGA ACG TTC CGC AGA 

100 110 

glu lys ala glu arg leu thr ser arg val lys ala leu phe 

GAA AAG ARG GCC GAG CGT CTC ACC TCG AGG GTG AAG GCA CTG TTC 

120 

ser val leu asn tyr glu arg ala arg arg pro gly leu leu gly 
AGC GTG CTC AAC TAC GAG CGG GCG CGG CGC CCC GGC CTC CTG GGC 

130 140 

ala ser val leu gly leu asp asp ile his arg ala trp arg thr 
GCC TCT GTG CTG GGC CTG GAC GAT ATC CAC AGG GCC TGG CGC ACC 

150 

phe val leu arg val arg ala gin asp pro pro pro glu leu tyr 
TTC GTG CTG CGT GTG CGG GCC CAG GAC CCG CCG CCT GAG CTG TAC 

160 170 

phe val lys val asp val thr gly ala tyr asp thr ile pro gin 

TTT GTC AAG GTG GAT GTG ACG GGC GCG TAC GAC ACC ATC CCC CAG 

180 

asp arg leu thr glu val ile ala ser ile ile lys pro gin asn 
GAC AGG CTC ACG GAG GTC ATC GCC AGC ATC ATC AAA CCC CAG AAC 
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190 

thr tyr cys val arg 
ACG TAC TGC GTG CGT 



gly thr ser ala arg 
GGC ACG TCC GCA AGG 

220 

gin gly ile pro gin 
CAG GGG ATC CCG CAG 



leu cys tyr gly asp 
CTG TGC TAC GGC GAC 

250 

arg asp gly leu leu 
CGG GAC GGG CTG CTC 



thr pro his leu thr 
ACA CCT CAC CTC ACC 

280 

arg gly val pro glu 
CGA GGT GTC CCT GAG 



val val asn phe pro 
GTG GTG AAC TTC CCT 

310 

phe val gin met pro. 
TTT GTT CAG ATG CCG 



leu leu asp thr arg 
CTG CTG GAT ACC CGG 

340 

tyr ala arg thr ser 
TAT GCC CGG ACC TCC 



phe lys ala gly arg 
TTC AAG GCT GGG AGG 

370 

arg leu lys cys his 
CGG CTG AAG TGT CAC 



arg tyr ala val val 
CGG TAT GCC GTG GTC 

210 

pro ser arg ala thr 
CCT TCA AGA GCC ACG 



gly ser ile leu ser 
GGC TCC ATC CTC TCC 

240 

met glu asn lys leu 
ATG GAG AAC AAG CTG 



leu arg leu val asp 
CTG CGT TTG GTG GAT 

270 

his ala lys thr phe 
CAC GCG AAA ACC TTC 



tyr gly cys val val 
TAT GGC TGC GTG GTG 

300 

val glu asp glu ala 
GTA GAA GAC GAG GCC 



ala his gly leu phe 
GCC CAC GGC CTA TTC 

330 

thr leu glu val gin 
ACC CTG GAG GTG CAG 



ile arg ala ser leu 
ATC AGA GCC AGT CTC 

360 

asn met arg arg lys 
AAC ATG CGT CGC AAA 



ser leu phe leu asp 
AGC CTG TTT CTG GAT 



200 

gin lys ala ala met 
CAG AAG GCC GCC ATG 



ser tyr val gin cys 
TCC TAC GTC CAG TGC 

230 

thr leu leu cys ser 
ACG CTG CTC TGC AGC 



phe ala gly ile arg 
TTT GCG GGG ATT" CGG 

260 

asp phe leu leu val 
GAT TTC TTG TTG GTG 



leu arg thr leu val 
CTC AGG ACC CTG GTC 

290 

asn leu arg lys thr 
AAC TTG CGG AAG ACA 



leu gly gly thr ala 
CTG GGT GGC ACG GCT 

320 

pro trp cys gly leu 
CCC TGG TGC GGC CTG 



ser asp tyr ser ser 
AGC GAC TAC TCC AGC 

350 

thr phe asn arg gly 
ACC TTC AAC CGC GGC 



leu phe gly val leu 
CTC TTT GGG GTC TTG 

380 

leu gin val asn ser 
TTG CAG GTG AAC AGC 
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390 

leu gin thr val cys thr asn ile tyr lys ile leu leu leu gin 
CTC CAG ACG GTG TGC ACC AAC ATC TAC AAG ATC CTC CTG CTG CAG 



400 

ala tyr arg phe his 
GCG TAC AGG TTT CAC 



gin val trp lys asn 
CAA GTT TGG AAG AAC 

430 

arg leu pro leu leu 
CGG CTC CCT CTG CTA 



ala cys val leu gin 
GCA TGT GTG CTG CAG 

420 

pro his phe ser cys 
CCA CAT TTT TCC TGC 



leu his pro glu ser 
CTC CAT CCT GAA AGC 



410 

leu pro phe his gin 
CTC CCA TTT CAT CAG 



ala ser ser leu thr 
GCG TCA TCT CTG ACA 

440 

gin glu arg arg asp 
CAA GAA' CGC AGG .GAT 



val ala gly gly gin 
GTC GCT GGG GGC CAA 

M 460 

O arg ala val ala val 
□ CGT GCA GTG GCT GTG 

in 

p thr pro cys his leu 
^ ACA CCG TGT CAC CTA 

% 490 

m pro asp ala ala glu 
if CCA GAC GCA GCT GAG 

S pro gly gly arg ser 
Z CCT GGA GGC CGC AGC 

ru . 

520 

his pro gly leu met 
CAT CCT GGA CTG ATG 



thr pro ala ala leu 
ACA CCA GCA GCC CTG 



450 

gly arg arg arg pro 
GGG CGC CGC CGG CCC 



pro pro ser ile pro 
CCA CCA AGC ATT CCT 

480 

arg ala thr pro gly 
CGT GCC ACT CCT GGG 



ser glu ala pro gly 
TCG GAA GCT CCC GGG 

510 

gin pro gly thr ala 
CAA CCC GGC ACT GCC 



ala thr arg pro gin 
GCC ACC CGC CCA CAG 

540 

ser arg arg ala tyr 
TCA CGC CGG GCT TAT 



ser ala leu arg gly 
TCT GCC CTC CGA GGC 

470 

ala gin ala asp ser 
GCT CAA GCT GAC TCG 



val thr gin asp ser 
GTC ACT CAG GAC AGC 

500 

asp asp ala asp cys 
GAC GAC GCT GAC TGC 



leu arg leu gin asp 
CTC AGA CTT CAA GAC 

530 

pro gly arg glu gin 
CCA GGC CGA GAG CAG 



thr ser gin gly gly 
ACG TCC CAG GGA GGG 



550 560 

arg gly gly pro his pro gly leu his arg trp glu ser glu ala 

AGG GGC GGC CCA CAC CCA GGC CTG CAC CGC TGG GAG TCT GAG GCC 

564 
OP 

TGA GTGAGTGTTTGGCCGAGGCCTGCATGTCCGGCTGAAGGCTGAGTGTCCGGCTGAGGC 



CTGAGCGAGTGTCCAGCCAAGGGCTGAGTGTCCAGCACACCTGCGTTTTCACTTCCCCAC 
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AGGCTGGCGTTCGGTCCACCCCAGGGCCAGCTTTTCCTCACCAGGAGCCCGGCTTCCACT 
CCCCACATAGGAATAGTCCATCCCCAGATTCGCCATTGTTCACCCTTCGCCCTGCCTTCC 
TTTGCCTTCCACCCCCACCATTCAGGTGGAGACCCTGAGAAGGACCCTGGGAGCTTTGGG 
AATTTGGAGTGACCAAAGGTGTGCCCTGTACACAGGCGAGGACCCTGCACCTGGATGGGG 
GTCCCTGTGGGTCAAATTGGGGGGAGGTGCTGTGGGAGTAAAATACTGAATATATGAGTT 
TTTCAGTTTTGGAAAAAAAAAAAAAAAAAAAAAAAAAA 
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Motif -1 

Ep pl23 . . .LV V SLIRCFF YVTEQQKS YSKT... 

Sp Tezl ...FIIPILQSFFYITESSDLRNRT... 

Sc Est2 ...LIPKIIQTFFYCTEISSTVTIV... 

HsTCPl . . . YV VELLRSFF YVTETTFQKNRL. . . 
consensus FFY TE 

K 

Motif 0 phhh K hRh R 

Ep pl23 . . .KSLGF APGKLRLIPJCKT--TFRPIMTFNKK1 V . . 

Sp Tezl . . .QKTTLPPAVIRLLPKKN--TFRLITNLRKRPL. . 

Sc Est2 ...TLSNFNHSKMRIIPKKSNNEFRIIAIPCRGAD. 

Hs TCP 1 ...ARPALLTSRLRFIPKPD-GLRPIVNMDYVVG 
consensus R PK RI 

AF 

Motif A h hDh GY h 

Ep pl23 . .. PKLFFATMDIEKC YDS VNREKLSTFLK. . . 

Sp Tezl ...RKKYFVRIDIKSCYDRIKQDLMFRIVK... 

Sc Est2 . ..PELYFMKFD VKSC YDSIPRMECMRILK... . 

Hs TCP1 ...PELYFVKVDVTGAYDTIPQDRLTEVIA...//... 
consensus F D YD 

Motif B hPQG pS hh 

Ep pi 23 ...NGKFYKQTKGIPQGLCVSSILSSFYYA... 

Sp Tezl .. .GNSQ YLQKVGIPQGSILSSFLCHF YME. . . 

Sc Est2 ...EDKCYIREDGLFQGSSLSAPIVDLVYD... 

Hs TCP1 ...RATSYVQCQGIPQGSILSTLLCSLCYG... 
consensus G QG S 

Y 

Motif C h FDDhhh 

Eppl23 ...PhfVNLLMRLTDDYLLITTQENN... 

SpTezl ...KKGSVLLRVVDDFLFITVNKKD... 

Sc Est2 ...SQDTL1LKLADDFLIISTDQQQ... 

Hs TCP1 . . . RRDGLLLRL VDDFLL VTPHLTH. . . 
consensus DD L 

Motif D Ghh cK 

Ep pl23 ...NVSRENGFKFNMKKL... 

SpTezl ...LNLSLRGFEKHNFST... 

Sc Est2 ...KKLAMGGFQKYNAKA... 

HsTCPl ...LRTLVRGVPEYGCVV... 
consensus G 
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FIGURE 50 



1 GCAGCGCTGC GTCCTGCTGC GCACGTGGGA AGCCCTGGCC CCGGCCACCC 
5 1 CCGCGATGCC GCGCGCTCCC CGCTGCCGAG CCGTGCGCTC CCTGCTGCGC 
101 AGCCACTACC GCGAGGTGCT GCCGCTGGCC ACGTTCGTGC GGCGCCTGGG 
1 5 1 GCCCCAGGGC TGGCGGCTGG TGCAGCGCGG GGACCCGGCG GCTTTCCGCG 
201 CGNTGGTGGC CCANTGCNTG GTGTGCGTGC CCTGGGANGN ANGGCNGCCC 
25 1 CCCGCCGCCC CCTCCTTCCG CCAGGTGTCC TGCCTGAANG ANCTGGTGGC 
301 CCGAGTGCTG CANANGCTGT GCGANCGCGG CGCGAANAAC GTGCTGGCCT 
35 1 TCGGCTTCGC GCTGCTGGAC GGGGCCCGCG GGGGCCCCCC CGAGGCCTTC 
40 1 ACCACCAGCG TGCGCAGCTA CCTGCCC AAC ACGGTGACCG ACGCACTGCG 
45 1 GGGGAGCGGG GCGTGGGGGC TGCTGCTGCG CCGCGTGGGC GACGACGTGC 
50 1 TGGTTCACCT GCTGGCACGC TGCGCGNTNT TTGTGCTGGT GGNTCCCAGC 
55 1 TGCGCCTACC ANGTGTGCGG GCCGCCGCTG TACCAGCTCG GCGCTGCNAC 
60 1 TC AGGCCCGG CCCCCGCCAC ACGCTANTGG ACCCGAANGC GTCTGGGATC 
65 1 CAACGGGCCT GGAACCATAG CGTCAGGGAG GCCGGGGTCC CCCTGGGCTG 
701 CCAGCCCCGG GTGCGAGGAG GCGCGGGGGC AGTGCCAGCC GAAGTCTGCC 
75 1 GTTGCCCAAG AGGCCCAGGC GTGGCGCTGC CCCTGAGCCG GAGCGGACGC 
80 1 CCGTTGGGCA GGGGTCCTGG GCCC ACCCGG GCAGGACGCC TGGACCGAGT 
85 1 GACCGTGGTT TCTGTGTGGT GTCACCTGCC AGACCCGCCG AAGAAGCCAC 
90 1 CTCTTTGGAG GGTGCGCTCT CTGGCACGCG CCACTCCCAC CCATCCGTGG 
95 1 GCCGCCAGCA CCACGCGGGC CCCCCATCCA CATCGCGGCC ACCACGTCCT 
1 00 1 GGGACACGCC TTGTCCCCCG GTGTACGCCG AGACC AAGCA CTTCCTCTAC 
1 05 1 TCCTCAGGCG AC AAGNACAC TGCGNCCCTC CTTCCTACTC AATATATCTG 
1101 AGGCCC AGCC TG ACTGGCGT TCGGG AGGTT CGTGGAGACA NTCTTTCTGG 
1151 TTCC AGGCCT TGGATGCCAG GATTCCCCGC AGGTTGCCCC GCCTGCCCC A 
1 20 1 GCGNTACTGG CAAATGCGGC CCCTGTTTCT GGAGCTGCTT GGGAACCACG 
1251 CGCAGTGCCC CTACGGGGTG TTCCTC AAGA CGCACTGCCC GCTGCGAGCT 
1301 GCGGTC ACCC CAGC AGCCGG TGTCTGTGCC CGGGAGAAGC CCCAGGGCTC 
1351 TGTGGCGGCC CCCGAGGAGG AGGAACACAG ACCCCCGTCG CCTGGTGCAG 
1 40 1 CTGCTCCGCC AGCACAGC AG CCCCTGGCAG GTGTACGGCT TCGTGCGGGC 
1 45 1 CTGCCTGCGC CGGCTGGTGC CCCCAGGCCT CTGGGGCTCC AGGCACAACG 
1501 AACGCCGCTT CCTCAGGAAC ACCAAGA AGT TCATCTCCCT GGGGAAGCAT 
1551 GCCAAGCTCT CGCTGCAGGA GCTGACGTGG AAGATGAGCG TGCGGGACTG 
1 60 1 CGCTTGGCTG CGCAGG AGCC C AGGGGTTGG CTGTGTTCCG GCCGCAGAGC 
1 65 1 ACCGTCTGCG TGAGGAGATC CTGGCCAAGT TCCTGCACTG GCTGATGAGT 
1 70 1 GTGTACGTCG TCGAGCTGCT CAGGTCTTTC TTTTATGTCA CGGAG ACCAC 
1 75 1 GTTTCAAAAG AAC AGGCTCT TTTTCTACCG GAAGAGTGTC TGGAGCAAGT 
1 80 1 TGCAAAGCAT TGGAATCAGA C AGCACTTGA AGAGGGTGCA GCTGCGGGAG 
1 85 1 CTGTCGG AAG CAGAGGTC AG GCAGC ATCGG GAAGCC AGGC CCGCCCTGCT 
1 90 1 GACGTCC AGA CTCCGCTTCA TCCCCAAGCC TGACGGGCTG CGGCCGATTG 
1 95 1 TG AACATGGA CTACGTCGTG GGAGCCAGAA CGTTCCGCAG AGAAAAGAGG 
2001 GCCGAGCGTC TCACCTCGAG GGTGAAGGCA CTGTTCAGCG TGCTCAACTA 
205 1 CGAGCGGGCG CGGCGCCCCG GCCTCCTGGG CGCCTCTGTG CTGGGCCTGG 
2101 ACGATATCCA CAGGGCCTGG CGCACCTTCG TGCTGCGTGT GCGGGCCCAG 
2151 GACCCGCCGC CTG AGCTGTA CTTTGTC AAG GTGGATGTGA CGGGCGCGTA 
220 1 CGAC ACCATC CCCC AGGAC A GGCTCACGGA GGTCATCGCC AGCATCATCA 
225 1 AACCCCAGAA CACGTACTGC GTGCGTCGGT ATGCCGTGGT CCAGAAGGCC 
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2301 GCCCATGGGC ACGTCCGCAA GGCCTTCAAG AGCCACGTCT CTACCTTGAC 
235 1 AGACCTCCAG CCGTACATGC GACAGTTCGT GGCTCACCTG CAGGANAACA 
2401 GCCCGCTGAG GGATGCCGTC GTCATCGAGC AGAGCTCCTC CCTGAATGAG 
2451 GCCAGCAGTG GCCTCTTCGA CGTCTTCCTA CGCTTCATGT GCCACCACGC 
2501 CGTGCGCATC AGGGGCAAGT CCTACGTCCA GTGCCAGGGG ATCCCGCAGG 
255 1 GCTCCATCCT CTCC ACGCTG CTCTGC AGCC TGTGCTACGG CGACATGGAG 
2601 AACAAGCTGT TTGCGGGGAT TCGGCGGGAC GGGCTGCTCC TGCGTTTGGT 
265 1 GGATGATTTC TTGTTGGTGA CACCTCACCT CACCCACGCG AAAACCTTCC 
2701 TCAGGACCCT GGTCCGAGGT GTCCCTGAGT ATGGCTGCGT GGTGAACTTG 
2751 CGGAAGACAG TGGTGAACTT CCCTGTAGAA GACGAGGCCC TGGGTGGCAC 
280 1 GGCTTTTGTT CAGATGCCGG CCCACGGCCT ATTCCCCTGG TGCGGCCTGC 
285 1 TGCTGGATAC CCGGACCCTG GAGGTGCAGA GCGACTACTC CAGCTATGCC 
290 1 CGGACCTCCA TCAGAGCCAG TCTC ACCTTC AACCGCGGCT TC AAGGCTGG 
295 1 GAGGAACATG CGTCGCAAAC TCTTTGGGGT CTTGCGGCTG AAGTGTCAC A 
300 1 GCCTGTTTCT GGATTTGC AG GTGAACAGCC TCCAGACGGT GTGC ACCAAC 
305 1 ATCTACAAGA TCCTCCTGCT GCAGGCGTAC AGGTTTCACG CATGTGTGCT 
3101 GC AGCTCCCA TTTC ATCAGC AAGTTTGGAA GAACCCCAC A TTTTTCCTGC 
3151 GCGTCATCTC TGAC ACGGCC TCCCTCTGCT ACTCCATCCT GAAAGCC AAG 
320 1 AACGCAGGGA TGTCGCTGGG GGCCAAGGGC GCCGCCGGCC CTCTGCCCTC 
325 1 CGAGGCCGTG CAGTGGCTGT GCCACCAAGC ATTCCTGCTC AAGCTGACTC 
3301 GACACCGTGT CACCTACGTG CCACTCCTGG GGTCACTCAG GACAGCCCAG 
335 1 ACGCAGCTGA GTCGGAAGCT CCCGGGGACG ACGCTGACTG CCCTGGAGGC 
340 1 CGCAGCCAAC CCGGCACTGC CCTC AGACTT CAAGACCATC CTGG ACTG AT 
345 1 GGCCACCCGC CCACAGCCAG GCCGAGAGCA GACACCAGCA GCCCTGTCAC 
350 1 GCCGGGCTCT ACGTCCCAGG GAGGGAGGGG CGGCCCACAC CCAGGCCCGC 
355 1 ACCGCTGGGA GTCTGAGGCC TGAGTGAGTG TTTGGCCGAG GCCTGCATGT 
360 1 CCGGCTGAAG GCTGAGTGTC CGGCTGAGGC CTGAGCGAGT GTCCAGCCAA 
365 1 GGGCTGAGTG TCCAGCACAC CTGCCGTCTT CACTTCCCCA CAGGCTGGCG 
370 1 CTCGGCTCCA CCCCAGGGCC AGCTTTTCCT CACCAGGAGC CCGGCTTCCA 
375 1 CTCCCCACAT AGGAATAGTC CATCCCCAGA TTCGCCATTG TTCACCCCTC 
380 1 GCCCTGCCCT CCTTTGCCTT CCACCCCCAC C ATCCAGGTG GAGACCCTGA 
385 1 GAAGGACCCT GGGAGCTCTG GGAATTTGGA GTGACCAAAG GTGTGCCCTG 
390 1 TACACAGGCG AGGACCCTGC ACCTGGATGG GGGTCCCTGT GGGTCAAATT 
395 1 GGGGGGAGGT GCTGTGGGAG TAAAATACTG AATATATGAG TTTTTCAGTT 
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1 

met 

GCAGCGCTGCGTCCTGCTGCGCACGTGGGAAGCCCTGGCCCCGGCCACCCCCGCG ATG 

10 

pro arg ala pro arg cys arg ala val arg ser leu leu arg ser 
CCG CGC GCT CCC CGC TGC CGA GCC GTG CGC TCC CTG CTG CGC AGC 

20 30 
his tyr arg glu val leu pro leu ala thr phe val arg arg leu 
GAC TAG CGC GAG GTG CTG CCG CTG GCC ACG TTC GTG CGG CGC CTG 

40 

gly pro gin gly trp arg leu val gin arg gly asp pro ala ala 
GGG CCC CAG GGC TGG CGG CTG GTG CAG CGC GGG GAC CCG GCG GCT 

50 60 
phe arg ala leu val ala gin cys leu val cys val pro trp asp 
TTC CGC GCG CTG GTG GCC CAG TGC CTG GTG TGC GTG CCC TGG GAC 

70 

ala arg pro pro pro ala ala pro ser phe arg gin val ser cys 
GCA CGG CCG CCC CCC GCC GCC CCC TCC TTC CGC CAG GTG TCC TGC 

30 90 
leu lys glu leu val ala arg val leu gin arg leu cys glu arg 
CTG AAG GAG CTG- GTG GCC CGA GTG CTG CAG AGG CTG TGC GAG CGC 

100 

gly ala lys asn val leu ala phe gly phe ala leu leu asp gly 
GGC GCG AAG A^C GTG CTG GCC TTC GGC TTC GCG CTG CTG GAC GGG 

110 120 
ala arg gly gly pro pro glu ala phe thr thr ser val arg ser 
GCC CGC GGG GGC CCC CCC GAG GCC TTC ACC ACC AGC GTG CGC AGC 

A. 

I 



130 

tyr leu pro asn thr val thr asp ala leu arg gly ser glv ala 
TAC CTG CCC AAC ACG GTG ACC GAC GCA CTG CGG GGG AGC GGG GCG 
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140 

trp gly leu leu leu arg arg 
TGG GGG CTG CTG CTG CGC CGC 



leu leu ala arg cys ala leu 
CTG CTG GCA CGC TGC GCG CTC 

170 

ala tyr gin val cys gly pro 
GCC TAC CAG GTG TGC GGG CCG 



150 

val gly asp asp val leu val his 
GTG GGC GAC GAC GTG CTG GTT CAC 

160 

phe val leu val ala pro ser cys 
TTT GTG CTG GTG GCT CCC AGC TGC 

180 

pro leu tyr gin leu gly ala ala 
CCG CTG TAC CAG CTC GGC GCT GCC 



thr gin ala arg pro pro pro 
ACT CAC- GCC CGG CCC CCG CCA 

M- 

O 200 

leu gly cys glu arg ala trp 
3;j CTG GGA TGC GAA CGG GCC TGG 

S 
M 

M val pro leu gly leu pro ala 
L GTC CCC CTG GGC CTG CCA GCC 

'•■ *-j -> r\ 

U 

03 ser ala ser arg ser leu pro 
0 AGT GCC AGC CGA AGT CTG CCG 

Pi 

ala ala pro glu pro glu arg 
GCT GCC CCT GAG CCG GAG CGG 



190 

his ala ser gly pro arg arg arg 
CAC GCT AGT GGA CCC CGA AGG CGT 

210 

asn his ser val arg glu ala gly 
AAC CAT AGC GTC AGG GAG GCC GGG 

220 

pro gly ala arg arg arg gly gly 
CCG GGT GCG AGG AGG CGC GGG GGC 

240 

leu pro lys arg pro arg arg gly 
TTG CCC AAG AGG CCC AGG CGT GGC 

250 

thr pro val gly gin gly ser trp 
ACG CCC GTT GGG CAG GGG TCC TGG 



260 

ala his pro gly arg thr arg 
GCC CAC CCG GGC AGG ACG CGT 



val val ser pro ala arg pro 
GTG GTG TCA CCT GCC AGA CCC 



270 

gly pro ser asp arg gly phe cys 
GGA CCG AGT GAC CGT GGT TTC TGT 

280 

ala glu glu ala thr ser leu glu 
GCC GAA GAA GCC ACC TCT TTG GAG 
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290 

giy ala leu ser gly thr arg 
GGT GCG CTC TCT GGC ACG CGC 



gin his his ala gly pro pro 
CAG CAC CAC GCG GGC CCC CCA 

320 

crp asp thr pro cys pro pro 
TGG GAC ACG CCT TGT CCC CCG 



leu tyr ser ser gly asp lys 
CTC TAC TCC TCA GGC GAC AAG 

350 

leu ser ser leu arg pro ser 
CTC AGC TCT CTG AGG CCC AGC 



glu thr ile phe leu gly ser 
GAG ACC ATC TTT CTG GGT TCC 

380 

arg arg leu pro arg leu pro 
CGC AGG TTG CCC CGC CTG CCC 



leu phe leu glu leu leu gly 
CTG TTT CTG GAG CTG CTT GGG 

410 

val leu leu lys thr his cys 
GTG CTC CTC AAG ACG CAC TGC 



ala ala gly val cys ala arg 
GCA GCC GGT GTC TGT GCC CGG 



300 

his ser his pro ser val gly arg 
CAC TCC CAC CCA TCC GTG GGC CGC 

310 

ser thr ser arg pro pro arg pro 
TCC ACA TCG CGG CCA CCA CGT CCC 

330 

val tyr ala glu thr lys his phe 
GTG TAC GCC GAG ACC AAG CAC TTC 

340 

glu gin leu arg pro ser phe leu 
GAG CAG CTG CGG CCC TCC TTC CTA 

360 

leu thr gly ala arg arg leu val 
CTG ACT GGC GCT CGG AGG CTC GTG 

370 

arg pro trp met pro gly thr pro 
AGG CCC TGG ATG CCA GGG ACT CCC 

390 

gin arg tyr trp gin met arg pro 
CAG CGC TAC TGG CAA ATG CGG CCC 

400 

asn his ala gin cys pro tyr gly 
AAC CAC GCG CAG TGC CCC TAC GGG 

420 

pro leu arg aia ala val thr pro 
CCG CTG CGA GCT GCG GTC ACC CCA 

430 

glu lys pro gin gly ser val ala 
GAG AAG CCC CAG GGC TCT GTG GCG 
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440 450 
aia pre glu glu glu asp thr asp pro arg arg leu val gin leu 
GCC CCC GAG GAG GAG GAC ACA GAC CCC CGT CGC CTG GTG CAG CTG 

460 

leu arg gin his ser ser pro trp gin val tyr gly phe val arg 
CTC CGC CAG CAC AGC AGC CCC TGG CAG GTG TAC GGC TTC GTG CGG 

470 430 
ala cys leu arg arg leu val pro pro gly leu trp gly ser arg 
GCC TGC CTG CGC CGG CTG GTG CCC CCA GGC CTC TGG GGC TCC AGG 

490 

his asn glu arg arg phe leu arg asn thr lys lys phe ile ser 
CAC AAC GAA CGC CGC TTC CTC AGG AAC ACC AAG AAG TTC ATC TCC 

500 510 
leu gly lys his ala lys leu ser leu gin glu leu thr trp lys 
CTG GGG AAG CAT GCC AAG CTC TCG CTG CAG GAG CTG ACG TGG AAG 

520 

met ser val arg asp cys ala trp leu arg arg ser pro gly val 
ATG AGC GTG CGG GAC TGC GCT TGG CTG CGC AGG AGC CCA GGG GTT 

■ 530 540 
gly cys val pro ala ala glu his arg leu arg glu glu ile leu 
GGC TGT GTT CCG GCC GCA GAG CAC CGT CTG CGT GAG GAG ATC CTG 

550 

ala lys phe leu his trp leu met ser val tyr val val glu leu 
GCC AAG TTC CTG CAC TGG CTG ATG AGT GTG TAC GTC GTC GAG CTG 

560 570 
leu arg ser phe phe tyr val thr glu thr thr phe gin lys asn 
CTC AGG TCT TTC TTT TAT GTC ACG GAG ACC ACG TTT CAA AAG AAC 

580 

arg leu phe phe tyr arg pro ser val trp ser lys leu gin ser 
AGG CTC TTT TTC TAC CGG CCG AGT GTC TGG AGC AAG TTG CAA AGC 

590 600 
ile gly ile arg gin his leu lys arg val gin leu arg glu leu 
ATT GGA ATC AGA CAG CAC TTG AAG AGG GTG CAG CTG CGG GAG CTG 



% 
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610 

ser glu ala glu val arg gin his arg glu ala arg pro ala leu 
TCG GAA GCA GAG GTC AGG CAG CAT CGG GAA GCC AGG CCC GCC CTG 




620 

leu thr ser arg leu arg phe 
CTG ACG ?CC AGA CTC CGC TTC 



pro ile val asn met asp tyr 
CCG ATT GTG AAC ATG GAC TAC 



630 

ile pro lys pro asp gly leu arg 
ATC CCC AAG CCT GAC GGG CTG CGG 

640 

val val gly ala arg thr phe arg 
GTC GTG GGA GCC AGA ACG TTC CGC 



650 

arg glu lys arg ala glu arg 
AGA GAA AAG AGG GCC GAG CGT 

M 

p phe ser val leu asn tyr glu 
C!t TTC AGC GTG CTC AAC TAC GAG 

in 

i^H 680 

U sriy aia ser val - eu sly i eu 

\* GGC GCC TCT GTG CTG GGC CTG 

O 

thr phe val leu arg val arg 
p ACC TTC GTG CTG CGT GTG CGG 

Pi 

B 7io 

tyr phe val lys val asp val 
TAC TTT GTC AAG GTG GAT GTG 



gin asp arg leu thr glu val 
CAG GAC AGG CTC ACG GAG GTC 

740 

asn thr tyr cys val arg arg 
AAC ACG TAC TGC GTG CGT CGG 



660 

leu thr ser arg val lys ala leu 
CTC ACC TCG AGG GTG AAG GCA CTG 

670 

arg ala arg arg pro gly leu leu 
CGG GCG CGG CGC CCC GGC CTC CTG 

6S0 

asp asp ile his arg ala trp arg 
GAC GAT ATC CAC AGG GCC TGG CGC 

700 

ala gin asp pro pro pro glu leu 
GCC CAG GAC CCG CCG CCT GAG CTG 

720 

thr gly ala tyr asp thr ile pro 
ACG GGC GCG TAC GAC ACC ATC CCC 

720 

ile ala ser ile ile lys pro gin 
ATC GCC AGC ATC ATC AAA CCC CAG 

750 

tyr ala val val gin lys ala ala 
TAT GCC GTG GTC CAG AAG GCC GCC 



760 

his gly his val arg lys ala phe lys ser his val ser thr leu 
CAT GGG CAC GTC CGC AAG GCC TTC AAG AGC CAC GTC TCT ACC TTG 
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770 780 
thr asp leu gin pro tyr met arg gin phe val ala his leu gin 
ACA GAC CTC CAG CCG TAC ATG CGA CAG TTC GTG GCT CAC CTG CAG 



glu thr ser pro leu arg asp 
GAG ACC AGC CCG CTG AGG GAT 

S00 

ser leu asn glu ala ser ser 
TCC CTG AAT GAG GCC AGC AGT 

phe met cys his his ala val 
TTC ATG TGC CAC CAC GCC GTG 

830 

gin cys gin gly ile pro gin 
CAG TGC CAG GGG ATC CCG CAG 

cys ser leu cys tyr gly asp 
TGC AGC- CTG TGC TAC GGC GAC 

860 

ile arg arg asp gly leu leu 
ATT CC-C- CGG GAC GGG CTG CTC 



leu val thr pro his leu thr 
TTG GTG ACA CCT CAC CTC ACC 

890 

leu val arg gly val pro glu 
CTG GTC CGA GGT GTC CCT GAG 



lys thr val val asn phe pro 
AAG ACA GTG GTG AAC TTC CCT 



790 

ala val val ile glu gin ser ser 
GCC GTC GTC ATC GAG CAG AGC TCC 

810 

gly leu phe asp val phe leu arg 
GGC CTC TTC GAC GTC TTC CTA CGC 

820 

arg ile arg gly lys ser tyr val 
CGC ATC AGG GGC AAG TCC TAC GTC 

840 

gly ser ile leu ser thr leu leu 
GGC TCC ATC CTC TCC ACG CTG CTC 

850 

met glu asn lys leu phe ala gly 
ATG GAG AAC AAG CTG TTT GCG GGG 

870 

leu arg leu val asp asp phe leu 
CTG CGT TTG GTG GAT GAT TTC TTG 

880 

his ala lys thr phe leu arg thr 
CAC GCG AAA ACC TTC CTC AGG ACC 

900 

tyr gly cys val val asn leu arg 
TAT GGC TGC GTG GTG AAC TTG CGG 

910 

val glu asp glu ala leu gly gly 
GTA GAA GAC GAG GCC CTG GGT GGC 



920 930 
thr ala phe val gin met pro ala his gly leu phe pro trp cys 
ACG GCT TTT GTT CAG ATG CCG GCC CAC GGC CTA TTC CCC TGG TGC 
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gly leu leu leu asp thr arg 
GGC CTG CTG CTG GAT ACC CGG 

950 

ser ser tyr ala arg thr ser 
TCC AGC TAT GCC CGG ACC TCC 



arg gly phe lys ala gly arg 
CGC GGC TTC AAG GCT C-GG AGG 

980 

val leu arg leu lys cys his 
GTC TTG CGG CTG AAG TGT CAC 



asn ser leu gin thr val cys 
AAC AGC CTC CAG ACG GTG TGC 

1010 

leu gin ala tyr arg phe his 
CTG CAG GCG TAC AGG TTT CAC 



his gin gin val trp lys asn 
CAT CAG CAA GTT TGG AAG AAC 

1040 

ser asp chr ala ser leu cys 
TCT GAC ACG GCC TCC CTC TGC 



ala gly met ser leu gly ala 
GCA GGC- ATG TCG CTG GGG GCC 

1070 

ser glu ala val gin trp leu 
TCC GAG GCC GTG CAG TGG CTG 



leu thr arg his arg val thr 
CTG ACT CGA CAC CGT GTC ACC 



94G 

chr leu glu val gin ser asp tyr 
ACC CTG GAG GTG CAG AGC GAC TAC 

960 

ile arg ala ser val thr phe asn 
ATC AGA GCC AGT GTC ACC TTC AAC 

970 

asn met arg arg lys leu phe gly 
AAC ATG CGT CGC AAA CTC TTT GGG 

990 

ser leu phe leu asp leu gin val 
AGC CTG TTT CTG GAT TTG CAG GTG 

1000 

thr asn ile tyr lys ile leu leu 
ACC AAC ATC TAC AAG ATC CTC CTG 

1020 

ala cys val leu gin leu pro phe 
GCA TGT GTG CTG CAG CTC CCA TTT 

1030 

pro thr phe phe leu arg val ile 
CCC ACA TTT TTC CTG CGC GTC ATC 

1050 

tyr ser ile leu lys ala lys asn 
TAC TCC ATC CTG AAA GCC AAG AAC 

1060 

lys gly ala ala gly pro leu pro 
AAG GGC GCC GCC GGC CCT CTG CCC 

1080 

cys his gin ala phe leu leu lys 
TGC CAC CAA GCA TTC CTG CTC AAG 

1090 

tyr val pro leu leu gly ser leu 
TAC GTG CCA CTC CTG GGG TCA CTC 
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1100 1110 
arg thr ala gin thr gin leu ser arg lys leu pro gly thr thr 
AGG ACA GCC CAG ACG CAG CTG AGT CGG AAG CTC CCG GGG ACG ACG 

1120 

leu thr ala leu glu ala ala ala asn pro ala leu pro ser asp 
CTG AC? GCC CTG GAG GCC GCA GCC AAC CCG GCA CTG CCC TCA GAC 

1130 1132 
phe lys thr ile leu asp OP 

TTC AAG ACC ATC CTG GAC TGA TGGCCACCCGCCCACAC-CCAGGCCGAGAGCAGA 

CACCAGCAGCCCTGTCACGCCGGGCTCTACGTCCCAGGGAGGGAGGGGCGGCCCACACCC 

AGGCCCGCACCGCTGGGAGTCTGAGGCCTGAGTGAGTGTTTGGCCGAGGCCTGCATGTCC 

GGCTGAAGGCTGAGTGTCCGGCTGAGGCCTGAGCGAGTGTCCAGCCAAGGGCTGAGTGTC 

CAGCACACCTGCCQTCTTCACTTCCCCACAGGCTGGCGCTCGGCTCCACCCCAGGGCCAG 

CTTTTCYTCACCAGGAGCCCGGCTTCCACTCCCCACATAGGAATAGTCCATCCCCAGATT 

CGCCATTGTTCACCCYTCGCCCTGCCYTCCTTTGCCTTCCACCCCCACCATCCAGGTGGA 

GACCCTGAGAAGGACCCTGGGAGCTCTGGGAATTTGGAGTGACCAAAGGTGTGCCCTGTA 

CACAGGCGAGGACCCTGCACCTGGATGGGGGTCCCTGTGGGTCAAATTGGGGGGAGGTGC 

TGTGGGAGTAAJLATACTGAATATATGAGTTTTTCA.GTTTTGRAAAJ^WLAAAAAAAAAA 

AAAAAAAA&A 
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